METHODS TO IDENTIFY AGENTS MODULATING 
FUNCTIONS OF POLYPEPTIDE GALNAC-TRANSFERASES, 
PHARMACEUTICAL COMPOSITIONS COMPRISING SUCH AGENTS AND 
THE USE OF SUCH AGENTS FOR PREPARING MEDICAMENTS 

5 This application claims priority from U.S. Provisional Application Serial No. 

60/425,204, filed November 8, 2002, and from International Application PCT/DK03/ 

(Attomey docket number 04305/200H154-WO1), filed November 6, 2003, each of which 
is hereby incorporated by reference in its entirety. 

TECHNICAL FIELD 

10 The present invention relates generally to the biosynthesis, sorting and secretion of 

mucins, 0-glycosylated glycoproteins, and glycoproteins. More specifically, it relates to 
modulation of the fimctions of a homologous family of UDP-A^-acetyl-a-D-galactosamine: 
polypeptide A^-acetylgalactosaminyltransferases (GalNAc-transferases), which are 
generally characterized by the enzyme activity to add iV-acetylgalactosamine (GalNAc) to 

15 the hydroxy group of serine and threonine amino acid residues in peptides, glycopeptides, 
proteins, and glycoproteins. 

In particular, this invention concerns a method of inhibiting or modulating 
fimctions mediated by lectin domains of polypeptide GalNAc-transferases. More 
particularly, this invention is related to a lectin domain found in the C-terminal region of 

20 most GalNAc-transferases, which is structurally and fimctionally distinct from the catalytic 
domain of the enzymes and can be expressed in a fimctional form as a truncated or mutated 
protein that is enzymatically inactive. Still more particularly, this invention is related to the 
lectin domains of more than 16 GalNAc-transferases, designated GalNAc-Tl to GalNAc- 
T16. 

25 Further, this invention concems a method of screening one or more test substances 

for the ability to inhibit or modulate polypeptide GalNAc-transferase lectins in a cell-free 
or cell-based assay, in order to find agents which are effective in binding to one or more 
lectin domains of polypeptide GalNAc-transferases, for example, selective inhibitors of the 



binding properties of the above mentioned lectin domains and selective inhibitors of the 
effects that these lectin domains exert on intracellular transport, transport to cell surface, 
and secretion of mucins, glycoproteins, and proteins. 

Even further, this invention provides a preferable inhibitor, GalNAcp-benzyl, 
5 which is a novel inhibitor and representative of a novel group of inhibitors which display 
the common characteristic of selectively inhibiting lectins of polypeptide GalNAc- 
transferases in direct binding assays and not serve as substrates for other 
glycosyltransferases involved in 0-glycan biosyntheses, while exhibiting inhibitory activity 
of secretion and intracellular transport of mucins and glycoproteins in cells. GalNAcP- 
10 benzyl and related inhibitors with the same biological functions represent preferable 

selective inhibitor compared to GalNAca-benzyl because these do not serve as substrates 
for glycosyltransferases extending 0-glycans and do not provide a general inhibition of the 
0-glycosylation process in cells. 

BACKGROUND OF THE INVENTION 

15 Mucin-type O-glycosylation, one of the most abundant forms of protein 

glycosylation, is found on secreted and cell surface associated glycoproteins of all 
eukaryotic cells except yeast. Mucin-type 0-glycans contribute to a number of important 
molecular functions, including: direct effects on protein conformation, solubility, and 
stability; specific receptor fimctions that regulate cell trafficking and cell-cell interactions; 

20 and microbial clearance. Mucin-type O-glycans are synthesised in the Golgi through the 
sequential addition of saccharide residues, first to hydroxyl groups on serines and 
threonines of a protein core and subsequently to hydroxyl groups on the growing 
saccharide chains that extend from the protein core. There is great diversity in the structures 
created by 0-glycosylation (himdreds of potential structures), which are produced by the 

25 catalytic activity of himdreds of glycosyltransferase enzymes that are resident in the Golgi 
complex. Diversity exists at the level of the glycan structure and in positions of attachment 
of 0-glycans to protein backbones. Despite the high degree of potential diversity, it is clear 
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that 0-glycosylation is a highly regulated process that shows a high degree of conservation 
among multicellular organisms. 

The factors that regulate the attachment of 0-glycans to particular protein sites and 
their extension into specific structures are poorly understood. Longstanding hypotheses in 
5 this area propose that mucin-type O-glycosylation occurs in a stochastic manner where 
structure of acceptor proteins combined with topology and kinetic properties of resident 
Golgi glycosyltransferases determine the order and degree of glycosylation \ This concept 
does not fully explain the high degree of regulation and specialisation that governs the O- 
glycosylation process. In particular it is difficult to envision how large mucin molecules 

10 with high densities of 0-glycans are glycosylated in the Golgi by stochastic mechanisms 
that also create other sparsely glycosylated proteins. 

The first step in mucin-type 0-glycosylation is catalysed by one or more members 
of a large family of UDP-GalNAc: polypeptide TV^acetylgalactosaminyltransferases 
(GalNAc-transferases) (EC 2.4.1.41), which transfer GalNAc to serine and threonine 

15 acceptor sites . To date twelve members of the mammalian GalNAc-transferase family 
have been identified and characterized ^, and several additional putative members of this 
gene family have been predicted firom analysis of genome databases. The GalNAc- 
transferase isoforms have different kinetic properties and show differential expression 
patterns temporally and spatially, suggesting that they have distinct biological functions ^. 

20 Sequence analysis of GalNAc-transferases have led to the hypothesis that these enzymes 
contain two distinct subunits: a central catalytic unit, and a C-terminal unit with sequence 
similarity to the plant lectin ricin, designated the "lectin domain" Previous experiments 
involving site-specific mutagenesis of selected conserved residues confirmed that 
mutations in the catalytic domain eUminated catalytic activity. In contrast, mutations in the 

25 "lectin domain" had no significant effects on catalytic activity of the GalNAc-transferase 
isoform, GalNAc-Tl ^. Thus, the C-terminal "lectin domain" was believed not to be 
functional and not to play roles for the enzymatic functions of GalNAc-transferases ^, 

However, recent evidence demonstrates that some GalNAc-transferases exhibit 
unique activities with partially GalNAc-glycosylated glycopeptides. The catalytic actions 
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of at least three GalNAc-transferase isoforms, GalNAc-T4, -T7, and -TIO, selectively act 
on glycopeptides corresponding to mucin tandem repeat domains where only some of the 
clustered potential glycosylation sites have been GalNAc glycosylated by other GalNAc- 
transferases \ GalNAc-T4 and -T7 recognize different GalNAc-glycosylated peptides 
5 and catalyse transfer of GalNAc to acceptor substrate sites in addition to those that were 
previously utilized. One of the functions of such GalNAc-transferase activities is predicted 
to represent a control step of the density of 0-glycan occupancy in mucins and mucin-like 
glycoproteins with high density of 0-glycosylation. It was hypothesized that such 
sequential actions of multiple GalNAc-transferase isoforms may be required to complete 
10 0-glycan attachments to some mucin peptide sequences allowing for detailed control of 
density. 

One example of this is the glycosylation of the cancer-associated mucin MUCl. 
MUCl contains a tandem repeat 0-glycosylated region of 20 residues 
(HGVTSAPDTRPAPGSTAPPA) with five potential 0-glycosylation sites. GalNAc-Tl, 

15 -T2, and -T3 can initiate glycosylation of the MUCl tandem repeat and incorporate at only 
three sites (HGVtSAPDTRPAPGstAPPA, GalNAc attachment sites in lower-case). 
GalNAc-T4 is unique in that it is the only GalNAc-transferase isoform identified so far that 
can complete the 0-glycan attachment to all five acceptor sites in the 20 amino acid tandem 
repeat sequence of the breast cancer associated mucin, MUCl. GalNAc-T4 transfers 

20 GalNAc to at least two sites not used by other GalNAc-transferase isoforms on the 

GalNAc4TAP24 glycopeptide (tAPPAHGVtSAPDTRPAPGstAPP . GalNAc attachment 
sites in lower case) An activity such as that exhibited by GalNAc-T4 appears to be 
required for production of the glycoform of MUCl expressed by cancer cells where all 
potential sites are glycosylated Normal MUCl fix)m lactating mammary glands has 

25 approximately 2.6 0-glycans per repeat and MUCl derived firom the cancer cell line 
T47D has 4.8 0-glycans per repeat The cancer-associated form of MUCl is therefore 
associated with higher density of 0-glycan occupancy and this is accomplished by a 
GalNAc-transferase activity identical to or similar to that of GalNAc-T4. 
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The specific mechanism by which GalNAc-T4, -T7, and -TIO recognize and 
function with GalNAc-glycosylated glycopeptides is not known. However, it was originally 
demonstrated that the GalNAc-glycopeptide specificity exerted by GalNAc-T4 is directed 
or at least dependent on its lectin domain. A smgle amino acid substitution in the T4 lectin 
5 domain predicted to inactivate its fiinction abolished the GalNAc-glycopeptide specificity 
of T4 without adversely affecting the basic catalytic mechanism of the transferase ^. This 
suggests that the lectin domain interacts with GalNAc-glycopeptides and confers a novel 
catalytic function to the enzyme protein. Despite extensive attempts it has in the past not 
been possible to demonstrate actual binding of the transferase and lectin to sugars and 

10 glycopeptides, but it was possible to demonstrate selective inhibition of the GalNAc- 
glycopeptide activity of GalNAc-T4 using 230 mM concentration of GalNAc ^. The 
present inventors and coworkers demonstrated that mM concentrations of GalNAca-benzyl 
can inhibit the lectin mediated GalNAc-glycopeptide substrate specificity of GalNAc-T4 as 
well as -T7 (PCT WO 01/85215 A2).Polypeptide GalNAc-transferases, which have not 

15 displayed apparent GalNAc-glycopeptide specificities, also appear to be modulated by their 
putative lectin domains (PCT WO 01/85215 A2). Recentiy, it was found that mutations in 
the GalNAc-Tl putative lectin domain, similarly to those previously analysed in GalNAc- 
T4 , modified the activity of the enzyme in a similar fashion as GalNAc-T4. Thus, while 
wild type GalNAc-Tl added multiple consequtive GaDSTAc residues to a peptide substrate 

20 with multiple acceptor sites, mutated GalNAc-Tl failed to add more than one GalNAc 
residue to the same substrate The mechanism is however not understood. 

Glycosylation confers physico-chemical properties including protease resistance, 
solubility, and stabihty to proteins Glycosylation furthermore confers changes in 
immunological responses to proteins and glycoproteins. 0-glycosylation on mucins and 

25 mucin-like glycoproteins protect these molecules found in the extracellular space and body 
fluids fi-om degradation. Control of 0-glycosylation with respect to sites and number 
(density) of O-glycan attachments to proteins as well as control of the 0-glycan structures 
made at specific sites or in general on glycoproteins, is of interest for several purposes. 
Diseased cells e.g. cancer cells often dramatically change their 0-glycosylation and the 
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altered glycans and glycoproteins may constitute targets for therapeutic and diagnostic 
measures Mucins functioning in body fluids may have different properties depending 
on density and structure of 0-glycans attached in protection against disease, including 
infections by microorganisms. Furthermore, mucins with different glycosylation may 
5 change physico-chemical properties including stability and solubiUty properties that may 
influence turnover and removal of mucous. A number of lung diseases, e.g. cystic fibrosis, 
asthma, chronic bronchitis, smokers lungs, are associated with symptomatic mucous 
accumulation and it is likely that the nature and structure of mucins play a role in the 
pathogenesis of such diseases. 

10 Partial inhibitors of 0-glycosylation in cells have been reported. Aryl-A^acetyl-a- 

galactosaminides such as benzyl-, phenyl-, and j[7-nitrophenyl-GalNAc were originally 
found to inhibit the second step in 0-glycosylation, the O-glycan processing step, by 
inhibiting synthesis of core 1 (Galpl-3GalNAcal-R) and more complex structures 
Benzyl-aGalNAc was also found to inhibit sialylation. It is generally believed that the 

15 downstream effects of benzyl-aGalNAc treatment are mediated by substrate competition of 
biosynthetic glycosylation products of benzyl-aGalNAc. Thus, e.g. the immediate 
glycosylation product of benzyl-aGalNAc is Galpl-3GaINAca-benzyl and this serves as 
an efficient substrate for the core 1 a2-3sialyltransferase ST3Gal-I ^'^'^^ GalNAca-benzyl 
has been the most widely used inhibitor of 0-glycosylation, but it has only been used in cell 

20 culture as effective treatment concentrations lead to intracellular build-up of vesicles with 
GalNAca-benzyl products and treated cells change morphology and growth characteristics 

26 

Treatment of cells with benzyl-aGalNAc inhibit O-glycan processing and affect 
apical sorting of some O-glycosylated proteins The mechanism for this is generally 
25 believed to be through inhibition of sialylation Inhibition of mucin secretion has also 
been observed in culture cells, more specifically HT29 MTX cells, but this effect is not 
generally foxmd in mucin secreting cells 

True inhibitors of 0-glycosylation, i.e. inhibitors of the initial O-glycan attachment 
process governed by polypeptide GalNAc-transferases have not been identified. Benzyl- 
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aGalNAc has been shown to exhibit inhibitory effect on the GalNAc-glycopeptide 
substrate specificity of some polypeptide GalNAc-transferases in test tube enzyme assays 
(PCX WO 01/85215 A2), however the effect of this compound in vivo on density of O- 
glycosylation is unknown. 
5 Inhibitors of the initiating step in 0-glycosylation could completely or selectively 

block attachment of 0-glycans to 0-glycosylation sites in proteins. Compounds inhibiting 
the catalytic function of a selected subset of the polypeptide GalNAc-transferase family 
may be predicted to only lead to partial inhibition of 0-glycosylation capacity of cells. 
Proteins with no or little O-glycosylation may have entirely different biological properties 

10 than their normal glycosylated coxmterparts. Complete inhibition of O-glycosylation is not 
desirable because of the many diverse functions of 0-glycans, and it is expected to result in 
cell death. Selective inhibition of O-glycosylation on the other hand is desirable in many 
cases such as cancer cells producing glycoproteins and mucins with denser O-glycosylation 
than normal cells. For example breast cancer cells appear to hyperglycosylate the cancer- 

15 associated cell surface mucin MUCl compared to glycosylation in normal cells The 
overexpression of MUCl and hyperglycosylation found in cancer cells are likely to be 
important for the pathobiology of cancers. Methods of inhibiting the hyperglycosylation of 
mucins in cancer cells are desirable. 

It is apparent from the above that inhibitors in the prior art interfere with 0-glycan 

20 processing, i.e. the glycosylation process that extend GalNAc residues directly attached to 
proteins at serine and threonine residues. Existing inhibitors of O-glycosylation are not 
suitable for therapeutic treatment in mammals including man as they profoundly affect O- 
glycosylation processing as well as lead to undesired morphological and growth effects on 
culture cells. 

25 Consequently, there exists a need in the art for therapeutic compounds that display 

selectively and limited inhibition of O-glycosylation without generally affecting the process 
of O-glycosylation. The present invention meets these needs, and further presents other 
related advantages. 
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SUMMARY OF THE INVENTION 

The present invention provides for isolated polypeptide lectin domains of GalNAc- 
transferases. The present invention also provides for mammalian GalNAc-transferase lectin 
domains, lectin-fimctional variants and fragments thereof. The present invention also 
5 provides for human GalNAc-Tl-T16 lectin domains having the sequences disclosed in 
Table m herein. 

The present invention also provides for isolated nucleic acids encoding lectin 
domains of GalNAc-transferases. These nucleic acid sequences can encode for mammalian 
GalNAc-transferase lectin domains, lectin-functional variants and fragments thereof The 

10 present invention also provides for isolated nucleic acids encoding human GalNAc-Tl-T16 
(these sequences are disclosed in Table m, herein). 

The present invention ftirther provides a method of inhibiting or modulating 
specific fimctions mediated by lectin domains of polypeptide GalNAc-transferases 
comprising administering an effective amount of an appropriate agent which is effective in 

15 binding to one or more lectin domains of polypeptide GalNAc-transferases and inhibiting 
ftmctions mediated by said lectin domains. 

The present invention specifically discloses a novel selective inhibitor of 
polypeptide GalNAc-transferase lectin domains, which provides a major advancement in 
that this inhibitor and related inhibitors sharing common characteristics of activity binds 

20 lectin domains without serving as acceptor substrate for glycosyltransferases involved in 
synthesis of 0-glycans. For this reason treatment of cells with this and related compounds 
is not expected to affect 0-glycosylation extension including 0-glycan branching and 
sialylation. A preferred embodiment of this invention is a selective inhibitor of GalNAc- 
transferase lectins that does not serve as substrates for other glycosyltransferases. 

25 In a preferred embodiment, the novel selective GalNAc-transferase lectin inhibitor 

is represented by the P-anomeric configuration of GalNAc-benzyl, GalNAcp-benzyl. 
Additional preferred selective inhibitors including but not limited to the following: 
GalNAcpl-R (R represents any aglycone such as benzyl, phenyl, p-nitrophenyl, or 
umbrelliferyl, without limitation), selective GalNAcp-peptides, mimetics or compounds 
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derived herefrom. The identifying characteristics of the preferred selective inhibitor is that 
it retains the binding activity for lectin domains similar to GalNAcp-benzyl and GalNAca- 
benzyl as determined in binding assays as disclosed in the present invention, and lacks the 
ability to serve as acceptor substrate for glycosyltransferases involved in 0-glycosylation as 
5 determined by standard enzyme assays widely known by the skilled in the art. 

The lectin domains confer unique properties to the GalNAc-transferases including 
but not limited to selective GalNAc-glycopeptide substrate specificity, as well as binding 
properties for peptides and carbohydrates to enhance catalytic properties and other 
functions related to the 0-glycosylation process. In a preferred embodiment, methods of 
10 selectively blocking the lectin mediated acceptor substrate specificities of such GalNAc- 
transferase isoforms by pGalNAc and pGalNAc containing structures are disclosed. 

The present invention provides a novel method for large scale screening of test 
substances for the ability to inhibit lectin-mediated activity of polypeptide GalNAc- 
transferases in a cell-free assay, which comprises: 

15 (i) contacting an isolated polypeptide GalNAc-transferase, an isolated lectin domain 
from a polypeptide GalNAc-transferase, or a fragment of a polypeptide GalNAc- 
transferase capable of diplaying lectin-mediated binding on its substrate, with one 
or more test substances under assay conditions suitable for the detection of said 
binding ability; and 

20 (ii) measuring whether said lectin-mediated activity is thereby inhibited or modulated 
by one or more of the substances. 

The present invention also provides a method of screening one or more test 
substances for the ability to inhibit or modulate intracellular transport and/or cell surface 
expression of mucins, 0-glycosylated glycoproteins, glycoproteins and proteins in a cell- 
25 based assay, which comprises: 

(i) contacting a cell that expresses mucins, 0-glycosylated glycoproteins, 

glycoproteins and proteins, with one or more test substances under assay conditions 
suitable for the detection of inhibition or modulation of said expression; and 
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(ii) measuring whether intracellular transport and cell surface expression of said 
mucins, 0-glycosylated glycoproteins, glycoproteins and proteins are thereby 
inhibited or modulated by one or more of the substances. 

The present invention also provides a method of screening one or more test 
substances for the ability to inhibit or modulate secretions of mucins, 0-glycosylated 
glycoproteins, glycoproteins and proteins in a cell-based assay, which comprises: 

(i) contacting a cell that secretes mucins, 0-glycosylated glycoproteins, 
glycoproteins with one or more test substances under assay conditions suitable 
for the detection of inhibition or modulation of said secretion; and 

(ii) measuring whether secretion of said mucins, 0-glycosylated glycoproteins, 
glycoproteins and proteins are thereby inhibited or modulated by one or more of 
the substances. 

Substances identified as agents which are effective in binding to one or more lectin 
domains of polypeptide GalNAc-transferases using the above method of the invention may 
e.g. be selected from the group consisting of naturally or non-naturally occurring 
carbohydrates, peptides, glycopeptides, glycoconjugates and portions and fragments 
thereof They may also be found among nucleic acids as well as small organic or inorganic 
molecules. They include but are not limited to peptides such as soluble peptides including 
Ig-tailed fusion peptides, members of random peptide libraries and combinatorial 
chemistry-derived molecular libraries made of D- and/or L-configuration amino acids, 
phosphopeptides (including members of random or partially degenerate, directed 
phosphopeptide libraries), antibodies [e.g. polyclonal, monoclonal, humanized, anti- 
idiotypic, chimeric, single chain antibodies, fragments, (e.g. Fab, F(ab)2, and Fab 
expression library fragments, and epitope-binding fragments thereof)], and polypeptides. A 
substance to be used as an agent according to the invention may be an endogenous 
physiological compound or it may be a natural or synthetic compound. 

Agents in accordance with the present invention are useful for changing the density 
and sites of 0-glycan occupancy in mucins and 0-linked glycoproteins. Fiulher uses are in 
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changing Golgi-transport and intracellular sorting events conferred by the lectin domains of 
GalNAc-transferases. For example, inhibitors of lectin domains of GalNAc-transferases 
may be useful in manipulating disease associated 0-glycosylation to augment immunity 
and to prepare vaccines. Further use may be found in manipulating mucin secretion and O- 
5 glycan density in diseases associated with mucous accumulation to decrease secretion and 
enhance clearance of mucins. Further use may entail modulating 0-glycosylation of 
recombinant glycoproteins by inhibition of polypeptide GalNAc-transferases in host 
expression cells. These and other aspects of the present invention will become evident upon 
reference to the following detailed description and drawings. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic of the predicted domain structure of polypeptide GalNAc- 
transferases Polypeptide GalNAc-transferases are predicted to be type n transmembrane 
proteins with a cytoplasmic N-terminal tail, a non-cleaved hydrophobic transmembrane 
15 retention signal (TM, grey box), a stem region of variable size, a well conserved catalytic 
unit of approximately 340 amino acid residues, and a poorly conserved C-terminal domain 
of approximately 130 amino acids which display structural similarity to the lectin ricin and 
have been designated the "lectin domain". The putative lectin domains consist of a-, p-, 
and y- ricin-like repeats. C indicates conserved cysteine residues among sequences. 

20 

Figure 2 is a multiple sequence alignment (ClustalW) of putative lectin domains 
derived from 16 human polypeptide GalNAc-transferases. Potitions of conserved motifs 
CLD and QxW in the a, p, and y repeats are indicated. The numbering indicated in the 
margins reflects numbering of the analysed sequence region of each GalNAc-transferase. 
25 Conserved residues are indicated by black box'ing. 

Figures 3 A and B are schematic representations of human soluble secreted MUCl 
expression constructs used for stable transfectants of CHO cells. Panel A: IgG2A His-tag 
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was inserted into Bsu36I/XbaI site of MUCIFL, generating a His-tagged MUCl construct 
containing the endogenous MUCl secretion signal peptide. Panel B: MuclFL Sau3AI 
insert was inserted into the BamHI site of pcDNA-inf., generating a non-tagged MUCl 
construct containing the y-interferon secretion signal peptide. 

Figure 4 is a plot of absorbance v. Ugand dilution showing GalNAc-transferase 
binding to GalNAc-MUCl glycopeptide. A direct binding assay (ELISA) mediated by the 
lectin domain was developed and validated with soluble secreted GalNAc-T4 and -T2 
enzyme proteins. ELISA plates were coated with peptides or glycopeptides at 1 |ag/ml, 
blocked with BSA, and incubated with biotinylated enzymes. After washing, bound 
enzyme proteins were detected with HRP-Streptavidin as described in detail in Example 2. 
Secreted soluble constructs of GalNAc-transferases which are enzymatically active may 
bind to (glyco)peptide substrates through their catalytic units as originally described for 
GalNAc-T2 However, GalNAc-transferase binding to substrates by the catalytic domain 
requires UDP and divalent cat-ions (binding destroyed by EDTA treatment), in accordance 
with previous experience Panel A: GalNAc-T4 wild type enzyme proteins (□) and 
GalNAc-T2 (■) selectively bind GalNAc-MUCl glycopeptides, with no significant 
binding observed to the unglycosylated peptide (GalNAc-T4 wt (A) and GalNAcT2 (A)). 
Panel B: Furthermore, the GalNAc-T4 lectin mutant did not bind to either glycosylated 
Mucl (GalNAc-Mucl) (•) or non glycosylated Mucl (Mucl) (o), whereas GalNAc-T4 
wild type binds GalNAc-Mucl (□) but not nonglcosylated Mucl (■). Binding was not 
affected by 10 mM EDTA. Soluble secreted GalNAc-T4 mutant, GalNAc-T4^^^" ^ in 
which the lectin domain has been selectively inactivated by a single amino acid 
substitution, showed no binding demonstrating that the binding observed with the wild type 
enzyme is mediated through the lectin domain. 

Figure 5 is a plot (absorbance at 495 v. concentration of inhibitor) showing 
inhibition of GalNAc-T4 lectin binding. Direct binding assays were performed with 
preincubation of GalNAc-T4 with inhibitors followed by incubation of GalNAc-T4 in 
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ELISA plates activated with GalNAc-Mucl as described in detail in Example 2. GalNAca- 
benzyl (■) as well as GalNAcp-benzyl (□) inhibit at 3-6 mM, whereas the control 
GlcNAca-benzyl (A) showed no inhibition. This demonstrates that the GalNAc- 
transferase lectin domains show no specificity for the anomeric configuration of GalNAc, 
5 and identifies a novel inhibitor, GalNAcP-benzyl, of GalNAc-transferase lectins. 

Figures 6 A-J are a series of photomicrographs showing immunostaining of wild 
type CHO and transfected wild type CHO cells with a secreted MUCl construct. 
CH01dlD/MUCsol-cloneD5 was estabUshed fi-om this population. MUCl expression in the 
10 cytoplasm of 10-20 % cells is visualized by HMFG2, SMS, and vu-4H5 antibodies. Anti-T 
antibody HH8 reacted only after neuraminidase pretreatment and the anti-Tn antibody 
reacted similarly before and after neuraminidase treatment. This suggests that cells grown 
in GalNAc alone produce mainly the Tn glycoform of MUCl, while cells grown in Gal and 
GalNAc produce mainly the sialylated T (core 1) glycoforms. 

15 

Figure 7 is a series of SDS-PAGE Westem analysis of MUCl secreted fi-om wild 
type CHO cells stably transfected cells with a secreted MUCl construct 
(CHOldlD/MUCsol-cloneDS). Neu + indicates pretreatment of samples with 
neuraminidase as described in Examples, Cells were grown in culture medium after 24 or 
20 48 hours analysed directly or after neuraminidase treatment. 

Figure 8 A-X (left to right fi-om top to bottom) is a series of photomicrographs 
showing immunostaining of CHO IdlD cells stably transfected cells with a fiill coding cell 
surface secreted MUCl construct. CH01dlD/MUClF-clone2 cells were grown in Optimem 
25 medium without and with 1 mM GalNAc, and 1 mM GalNAc plus 0.1 mM Gal for 24-48 
hours. Cells were trypsinized, washed, air-dried on cover slides, and immunostained as 
described in Examples with antibodies to MUCl and T and Tn carbohydrates. Reactivity 
was evaluated before and after neuraminidase treatment of dried acetone fixed cells. +/- neu 
indicates that the staining was identical with or without neuraminidase pretreatment. 
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Figures 9A and B are SDS-PAGE Western analysis of MUCl secreted from CHO 
IdlD cells stably transfected cells with a secreted MUCl construct. CHOldlD/MUCsol- 
cloneDS cells were grown in the presence or absence of sugars indicated, and samples of 
5 the culture medium analysed directly after 24-48 hours. Positive control (GalNAc-peptide) 
is a 60-mer MUCl tandem repeat GalNAc-glycopeptide glycosylated with hxmian 
polypeptide GalNAc-transferase GalNAc-T2. Lane labeled control includes medium from 
CHO IdlD cells. Anti-MUCl monoclonal antibodies 5E5 and HMFG2 were used. 

10 Figures lOA and B are SDS-PAGE Western analysis of MUCl secreted from 

CHO IdlD cells stably transfected cells with a secreted MUCl constmct. Same experiment 
as Figure 9 using anti-MUCl monoclonal antibodies VU-4H5 and VU-2G7. 

Figures llA-D are a series of photomicrographs (left to right from top to bottom 
15 of) anti-MUCl antibody immunofluorescense staining of CHO IdlD cells stably transfected 
with a fiill coding cell surface MUCl (CH01dlD/MUClF-clone2). Cell grown in the 
presence of GalNAc were treated with the 0-glycosylation inhibitor GalNAca-benzyl or 
control GlcNAca-benzyl. Cells were grown in plates and stained without permeabilization 
as described in Example 4, 

20 

Figure 12 is an SDS-PAGE Western analysis of GalNAca-benzyl inhibition of 
MUCl expression in CHO IdlD cells stably transfected with a fiiU coding MUCl construct. 
Cells were grown for 24 hours (lanes 1-6) or 48 hours (lanes 7-12) in the presence of 1 mM 
GalNAc (lanes 1-3 and 7-9) or 1 mM GalNAc and 0.1 mM Gal (lanes 4-6 and 10-12) to 
25 limit core 0-glycosylation to GalNAcal-O-Ser/Thr and Galpl-3GalNAcal-0-Ser/Thr, 
respectively. Cells were fiirther treated with 2 mM GalNAca-benzyl (lanes 1, 4, 7, 10), 2 
mM GlcNAca-benzyl (lanes 2, 5, 8, 1 1) or no inhibitor (lanes 3, 6, 9, 12). Cells were 
washed and lysed at 24 or 48 hoiu^ and the lysates subjected to immunoprecipitation with 
monoclonal antibody HMFG2, which broadly recognize MUCl glycoforms. 
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Immunoprecipitates were analysed by SDS-PAGE and western blot using HMFG2 
antibody to detect MUCl expression. Lane M indicates molecular markers with assigned 
mw. Lane C includes a control MUCl 180-mer tandem repeat peptide which has been 
GalNAc-glycosylated with 3 moles GalNAc per repeat using GalNAc-T2. The sharp bands 
5 migrating at 100-200 kd are immunoglobulins indicated by IgG. At 24 hours the MUCl 
glycoforms expressed by cells grown in GalNAc or Gal and GalNAc migrated similarly, 
indicating that synthesis of sialylated core 1 0-glycans were time limited (lanes 1-6). At 48 
hours, MUCl glycoforms migrating as higher molecular weight species were expressed by 
cells grown in Gal and GalNAc (lanes 11-12). Treatment with GalNAca-benzyl had no 

10 significant effect at 24 hours (lanes 1 and 4), but after 48 hours a significant reduction in 
MUCl expression was found in cells grown in GalNAc as well as in Gal and GalNAc 
(lanes 7 and 10). In the latter case a significant shift in migration fiirther confirmed that 
GalNAca-benzyl also serves as an inhibitor of 0-glycan extension and reduces O- 
glycosylation to GalNAcal-O-Ser/Thr. GlcNAca-benzyl served as a control and had no 

15 effect on MUCl expression and 0-glycosylation compared to untreated cells (lanes 8 and 
11). 

Figure 13 is a SDS-PAGE Western analysis resulting fi-om the same experiment as 
in Figure 12, but using a novel monoclonal antibody, 5E5, to visualize MUCl expression. 

20 Cells were grown for 48 hours in the presence of 1 mM GalNAc (lanes 1-3) or 1 mM 
GalNAc and 0.1 mM Gal (lanes 4-6) to limit core 0-glycosylation to GalNAcal-0- 
Ser/Thr and Galpl-3GalNAcal-0-Ser/Thr, respectively. Cells were fiirther treated with 2 
mM GalNAca-benzyl (lanes 1, 4), 2 mM GlcNAca-benzyl (lanes 2, 5) or no inhibitor 
(lanes 3, 6). Cells were washed and lysed at 48 hours and the lysates subjected to 

25 immunoprecipitation with monoclonal antibody HMFG2, which broadly recognize MUCl 
glycoforms. Immunoprecipitates were analysed by SDS-PAGE and westem blot using 5E5 
antibody, which selectively recognize GalNAc-glycosylated MUCl expression and show 
no reactivity with unglycosylated MUCl peptides. Lanes M and C as described in legend 
to Figure 12. Treatment with GalNAca-benzyl produced a significant reduction in MUCl 
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expression in cells grown in GalNAc as well as in Gal and GalNAc (lanes 1 and 4). In cells 
grown in Gal and GalNAc (lanes 4-6) only weak expression of MUCl was detected, but 
treatment of cells with GalNAca-benzyl still produced a marked shift in migration to lower 
molecular weight migrating species 

5 

Figure 14 is an SDS-PAGE Western analysis showing the identification of a novel 
inhibitor, GalNAcP-benzyl, which exhibits the same effect on mucin transport as 
GalNAca-benzyl, but does not affect 0-glycan extension and O-glycosylation in general. 
CHO IdlD cells stably transfected with a full coding MUCl construct were grown for 36 

10 hours in the presence of 1 mM GalNAc (lanes 1-3 and 7-9) or 1 mM GalNAc and 0,1 mM 
Gal (lanes 4-6 and 10-12) to limit core O-glycosylation to GalNAcal-O-Ser/Thr and 
Gaipi-3GalNAcal-0-Ser/Thr, respectively. Cells were further treated with 2 mM 
GalNAca-benzyl (lanes 1, 4, 7, 10), 2 mM GalNAcP-benzyl (lanes 2, 5, 8, 1 1) or 2 mM 
GlcNAca-benzyl (lanes 3, 6, 9, 12). Cells were washed and lysed at 36 hours and the 

15 lysates subjected to immxmoprecipitation with monoclonal antibodies HMFG2 (lanes 1-6) 
or 5E5 (lanes 7-12). Immunoprecipitates were analysed by SDS-PAGE and westem blot 
using HMFG2 antibody. Lanes M and C as described in legend to Figure 7. Treatment with 
GalNAcP-benzyl produced the same or better reduction in MUCl expression as treatment 
with GalNAca-benzyl in cells grown in GalNAc as well as in Gal and GalNAc (lanes 2, 5, 

20 8). In cells grown in Gal and GalNAc (lanes 4-6) MUCl expression was reduced with 
GalNAcP-benzyl treatment (lane 5), but in contrast to cells treated with GalNAca-benzyl 
(lane 4), GalNAcP-benzyl produced no change in the migration of MUCl demonstrating 
that this inhibitor does not affect O-glycosylation. The lack of immunoprecipitation of 
MUCl by 5E5 in cells grown in Gal and GalNAc (lanes 10-12) indicates that MUCl is 

25 glycosylated with more complex structures than GalNAcal -O-Ser/Thr as recognized by 
this antibody. 

Figure 15 is an SDS-PAGE Westem analysis resulting from the same experiment 
as Figure 14, except that expression is visualized by the monoclonal antibody 5E5. Cells 
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were grown for 36 hours in the presence of 1 mM GalNAc (lanes 1-3 and 7-9) or 1 mM 
GalNAc and 0.1 mM Gal (lanes 4-6 and 10-12) to limit core 0-glycosylation to 
GaLNAcal-O-Ser/Thr and Gaipi-3GalNAcal-0-Ser/Thr, respectively. Cells were further 
treated with 2 mM GalNAca-benzyl (lanes 1, 4, 7, 10), 2 mM GaUSfAcP-benzyl (lanes 2, 5, 
5 8, 1 1) or 2 mM GlcNAca-benzyl (lanes 3, 6, 9, 12). Cells were washed and lysed at 36 
hours and the lysates subjected to immunoprecipitation with monoclonal antibodies 
HMFG2 (lanes 1-6) or 5E5 (lanes 7-12). Immunoprecipitates were analysed by SDS- 
PAGE and westem blot using 5E5 antibody. Lanes M and C as described in legend to 
Figure 12. Treatment with GalNAcp-benzyl produced the same or better reduction in 
10 MUC 1 expression as treatment with GalNAca-benzyl in cells grown in GalNAc (lanes 2 
and 8). The lack of immunostaining of MUCl by 5E5 in cells grown in Gal and GalNAc 
indicates that MUCl is glycosylated with more complex structures than GalNAcal-0- 
Ser/Thr as recognized by this antibody. 

15 Figures 16A-0 (left to right from top to bottom) is a series of photomicrographs 

showing that the main 0-glycan phenotype of CHO IdlD cells grown in Gal and GalNAc 
is sialylated T and that the 5E5 antibody does not react with MUCl, with T or silaylated T 
glycoforms of MUC 1 . 

20 Figure 17 illustrates SDS-PAGE Westem analysis of MUC5AC secretion from 

HT29MTX cells using polyclonal anti-MUC5AC antibody LUM5-L In order to evaluate 
GalNAcjS-benzyl as an inhibitor of mucin glycosylation and secretion we used the HT-29 
metotrexate (MTX) cultured colon carcinoma cells of mucin secreting phenotype. HT-29 
MTX cells have a goblet cell Hke phenotype with constitutive production of MUC5AC. 

25 Cells were continuously grown up to 21 days in the presence of 5 mM GalNAca-benzyl, 5 
mM GalNAcP-benzyl, 5 mM GlcNAca-benzyl and without inhibitor. SDS-PAGE Westem 
blotting of media (10 jiL) using the polyclonal anti-MUC5AC antibody LUM5-1 detected 
MUC5AC in media from untreated cells (Lane 1) and cells treated with GlcNAca-benzyl 
(Lane 2). In striking contrast, MUC5 AC was not detected in the media from cells treated 
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with GalNAca-benzyl (Lane 3) or GalNAcjS-benzyl (Lane 4). Results shown are from 
media collected at 14 days. 

Figure 18 illustrates immuno-fluorescence staining of HT29MTX cells treated with 
5 inhibitors as indicated and stained with the anti-MUC5AC monoclonal antibody, CLH2. 
Treatment of cells with GalNAca-benzyl (Panel A) induced a storage phenotype with 
increased intracellular staining of MUC5AC. Cells treated with GalNAcjS-benzyl (Panel B) 
did not lead to a storage disease phenotype, and furthermore a clearly diminished 
intracellular staining of MUC5AC was found compared to cells treated with GlcNAca- 
10 benzyl (Panel C) and untreated cells (Panel D). 

DETAILED DESCRIPTION OF THE INVENTION 

All patent applications, patents, and literature references cited in this specification 
are hereby incorporated by reference in their entirety, hi the case of conflict, the present 
description, including definitions, is intended to control. 

15 1. Definitions 

The terms used in this specification generally have their ordinary meanings in the 
art, within ttie context of this invention and in the specific context where each term is used. 
Certain terms are discussed below, or elsewhere in the specification, to provide additional 
guidance to the practitioner in describing the compositions and methods of the invention 

20 and how to make and use them. 

As used herein, the term "about" or "approximately" means within 50% of a given 
value, preferably within 20%, more preferably within 10%, more preferably still within 
5%, and most preferably within 1% of a given value. Altematively, the term "about" or 
"approximately" means that a value can fall within a scientifically acceptable error range 

25 for that type of value, which will depend on how qualitative a measurement can be given 
the available tools. "About" or "approximately" may define a distribution around a mean 
value, rather than a single value. 
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Molecular Biology Definitions. In accordance with the present invention, there 
may be employed conventional molecular biology, microbiology and recombinant DNA 
techniques within the skill of the art. Such techniques are explained fully in the literature. 
See, for example, Sambrook, Fitsch & Maniatis, Molecular Cloning: A Laboratory 
5 Manual, Second Edition (1 989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York (referred to herein as "Sambrook et al., 1989"); DNA Cloning: A Practical 
Approach, Volumes I and n (D.N. Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait 
ed. 1984); Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins, eds. 1984); Animal 
Cell Culture (R.I. Freshney, ed. 1986); Immobilized Cells and Enzymes (IRL Press, 1986); 
10 B.E. Perbal, A Practical Guide to Molecular Cloning (1984); F.M. Ausubel et al. (eds.). 
Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994). 

"Nucleic acid" or "polynucleotide" as used herein refers to purine- and pyrimidine- 
containing polymers of any length, either polyribonucleotides or polydeoxyribonucleotides 
or mixed polyribo-polydeoxyribo nucleotides. This includes suagle- and double-stranded 
15 molecules, i.e., DNA-DNA, DNA-RNA and RNA-RNA hybrids, as well as "protein 

nucleic acids" (PNA) formed by conjugating bases to an amino acid backbone. This also 
includes nucleic acids containing modified bases (see below). 

"Complementary DNA or cDNA" as used herein refers to a DNA molecule or 
sequence that has been enzymatically synthesised fi-om the sequences present in an mRNA 
20 template, or a clone of such a DNA molecule. A "DNA Construct" is a DNA molecule or a 
clone of such a molecule, either single- or double-stranded, which has been modified to 
contain segments of DNA that are combined and juxtaposed in a manner that would not 
otherwise exist in nature. By way of non-limiting example, a cDNA or DNA which has no 
introns are inserted adjacent to, or within, exogenous DNA sequences. 
25 A plasmid or, more generally, a vector, is a DNA construct containing genetic 

information that may provide for its replication when inserted into a host cell. A plasmid 
generally contains at least one gene sequence to be expressed in the host cell, as well as 
sequences that facilitate such gene expression, including promoters and transcription 
initiation sites. It may be a linear or closed circular molecule. 

{W:\04305\100H154-USl\000g5062.DOC IDIilliniiniilllilDiDlllilllDIDD } 

19 



Nucleic acids are "hybridizable" to each other when at least one strand of one 
nucleic acid can anneal to another nucleic acid under defined stringency conditions. 
Stringency of hybridization is determined, e.g., by a) the temperature at which 
hybridization and/or washing is performed, and b) the ionic strength and polarity (e.g., 
5 formamide) of the hybridization and washing solutions, as well as other parameters. 
Hybridization requires that the two nucleic acids contain substantially complementary 
sequences; depending on the stringency of hybridization, however, mismatches maybe 
tolerated. Typically, hybridization of two sequences at high stringency (such as, for 
example, in an aqueous solution of 0.5X SSC, at 65°C) requires that the sequences exhibit 

10 some high degree of complementarity over their entire sequence. Conditions of 

intermediate stringency (such as, for example, an aqueous solution of 2X SSC at 65°C) and 
low stringency (such as, for example, an aqueous solution of 2X SSC at 55**C), require 
correspondingly less overall complementarily between the hybridising sequences. 
Hybridization stringency has been defined in numerous publication known to the skilled in 

15 the art (Meinkoth and Wahl, Anal.Biochem. 138,267-284, 1984; Maniatis et al., Molecular 
cloning: a laboratory manual. Cold Spring Harbor Laboratory Press, 1989; J.Q.Zhang, 
Eur.J.Biochem. 239,835-841:1996; M.Friedman-Einat, General and Comparative 
Endocrinology 115,354-363:1999: M.Szabo, J.Bacteriology, 1544-1553:1995; S.Sau, 
J.Bacteriology, 21 182126, 1996). Nucleic acids are 'Tiybridizable" to each other when at 

20 least one strand can anneal to another nucleic acid under defined stringency conditions. 
High stringency hybridization is defined as 42°C over night hybridization under standard 
conditions (Maniatis et al., Molecular cloning: a laboratory manual. Cold Spring Harbor 
Laboratory Press, 1989), followed by 5 washes with 2XSSC, 0.1%SDS at 42X, once with 
0.5XSSC, 0.1%SDS at 55°C and once with O.IXSSC, 0.1%SDS at 55X. (IXSSC is 

25 0. 1 5M NaCl, 0.0 1 5M Na citrate). Northern and Southern nucleic acid blotting 

hybridization techniques, especially for the purpose of investigating hybridization 
specificity, is well known to those skilled in the field of the invention. 

An "isolated" nucleic acid or polypeptide as used herein refers to a component that 
is removed fi-om its original environment (for example, its natural environment if it is 
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naturally occurring). An isolated nucleic acid or polypeptide contains less than about 50%, 
preferably less than about 75%, and most preferably less than about 90%, of the cellular 
components with which it was originally associated. 

A "probe" refers to a nucleic acid that forms a hybrid structure with a sequence in a 
5 target region due to complementarily of at least one sequence in the probe with a sequence 
in the target region. 

A nucleic acid that is "derived from" a designated sequence refers to a nucleic acid 
sequence that corresponds to a region of the designated sequence. This encompasses 
sequences that are homologous or complementary to the sequence, as well as "sequence- 

10 conservative variants" and "function-conservative variants". Sequence-conservative 

variants are those in which a change of one or more nucleotides in a given codon position 
results in no alteration in the amino acid encoded at that position. 

Function-conservative variants of polypeptide GalNAc-transferases are those in 
which a given amino acid residue in the polypeptide has been changed without altering the 

15 overall conformation and enzymatic activity (including substrate specificity) of the native 
polypeptide; these changes include, but are not limited to, replacement of an amino acid 
with one having similar physico-chemical properties.This includes but is not limited to, 
replacement of an amino acid with one having similar structural or physical properties, 
including polar or non-polar character, size, shape and charge (see, eg.. Table A). 

20 A "polypeptide" is a chain of chemical building blocks called amino acids that are 

linked together by chemical bonds called "peptide bonds". The term "protein" refers to 
polypeptides that contain the amino acid residues encoded by a gene or by a nucleic acid 
molecule (e.g., an mRNA or a cDNA) transcribed from that gene either directly or 
indirectly. Optionally, a protein may lack certain amino acid residues that are encoded by a 

25 gene or by an mRNA. For example, a gene or mRNA molecule may encode a sequence of 
amino acid residues on the N-terminus of a protein (i.e., a signal sequence) that is cleaved 
from, and therefore may not be part of, the final protein. A protein or polypeptide, 
including an enzyme, may be a "native" or "wild-type", meaning that it occurs in nature; or 
it may be a "mutant", "variant" or "modified", meaning that it has been made, altered, 
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derived, or is in some way different or changed from a native protein or from another 
mutant. 

A "mutation" means any process or mechanism resulting in a mutant protein, 
enzyme, polypeptide, polynucleotide, gene, or cell. This includes any mutation in which a 
5 protein, enzyme, polynucleotide, or gene sequence is altered, and any detectable change in 
a cell arising from such a mutation. The altered protein, enzyme, polypeptide or 
polynucleotide is a "mutant", also called a "variant." Typically, a mutation occurs in a 
polynucleotide or gene sequence, by point mutations (substitutions), deletions, or insertions 
of single or multiple nucleotide residues. A mutation includes polynucleotide alterations 
10 arising within a protein-encoding region of a gene as well as alterations in regions outside 
of a protein-encoding sequence, such as, but not limited to, regulatory or promoter 
sequences. A mutation in a gene can be "silent", Le., not reflected in an amino acid 
alteration upon expression, leading to a "sequence-conservative" variant of the gene. This 
generally arises when one amino acid corresponds to more than one codon. Table A 
15 outlines which amino acids correspond to which codon(s). 

Thus, due to the degeneracy of the genetic code, any three-nucleotide codon that 
encodes the GalNAc-transferase lectin domain polypeptides described herein is within the 
scope of the invention. 

The terms "mutant" and "variant" may also be used to indicate a modified or altered 
20 gene, DNA or RNA sequence, enzyme, cell, etc., Le., any kind of mutant. Such changes 
also include changes in the promoter, ribosome binding site, etc. 

As outlined above, amino acid substitutions are generally based on the relative 
similarity of the amino acid side-chain substituents, for example, their hydrophobicity, 
hydrophilicity, charge, size, and the like. Exemplary substitutions which take various of 
25 the foregoing characteristics into consideration are well known to those of skill in the art 
and include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine 
and asparagine; and valine, leucine and isoleucine. 

In addition, modifications, which do not normally alter the primary sequence of the 
GalNAc-transferase lectin domain polypeptides, include in vivo or in vitro chemical 
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derivatization of polypeptides, e.g., acetylation, methylation, or carboxylation. Also 
included as variant polypeptides of this invention are these polypeptides modified by 
glycosylation, e.g., those made by modifying the glycosylation pattems of a polypeptide 
during its synthesis and processing or in further processing steps; or by exposing the 
5 polypeptide to enzymes which affect glycosylation, such as mammalian glycosylating or 
deglycosylating enzymes. Also embraced as variant polypeptides are the above-identified 
mutagenized sequences, which have phosphorylated amino acid residues, e.g., 
phosphotyrosine, phosphoserine, or phosphothreonine. 
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TABLE A 



Amino Acids, Corresponding Codons, and Functionality/Property 



Amino Acid 


SLC 


DNA codons 


Side Chain Property 


Isoleucine 


I 


ATT,ATC,ATA 


Hydrophobic 


Leucine 


L 


CTT, CTC, CTA, CTG, TTA, TTG 


Hydrophobic 


Valine 


V 


GTT, GTC, GTA, GTG 


Hydrophobic 


Phenylalanine 


F 


TTT,TTC . 


Aromatic side chain 


Methionine 


M 


ATG 


Sulphur group 


Cysteine 


C 


TGT, TGC 


Sulphur group 


Alanine 


A 


GCT, GCC, GCA, GCG 


Hydrophobic 


Glycine 


G 


GGT, GGC, GGA, GGG 


Hydrophobic 


Proline 


P 


CCT, CCC, CCA, CCG 


Secondary amine 


Threonine 


T 


ACT, ACC, ACA, ACG 


Aliphatic hydroxyl 


Serine 


S 


TCT, TCC, TCA, TCG, AGT, AGC 


Aliphatic hydroxyl 


Tyrosine 


T 


TAT,TAC 


Aromatic side chain 


Tryptophan 


W 


TGG 


Aromatic side chain 


Glutamine 


Q 


CAA, CAG 


Amide group 


Asparagine 


N 


AAT, AAC 


Amide group 


Histidine 


H 


CAT, CAC 


Basic side chain 


Glutamic acid 


E 


GAA, GAG 


Acidic side chain 


Aspartic Acid 


D 


GAT, GAC 


Acidic side chain 


Lysine 


K 


AAA, AAG 


Basic side chain 


Arginine 


R 


CGT, CGC, CGA, CGG, AGA, AGG 


Basic side chain 


Stop codons 


Stop 


TAA, TAG, TGA 





As referred to herein, "sequence similarity" means the extent to which nucleotide or 
protein sequences are related. The extent of similarity between two sequences can be based 
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on percent sequence identity and/or conservation. Amino acids other than those indicated 
as conserved may differ in a protein or enzyme so that the percent protein or amino acid 
sequence similarity between any two proteins of similar function may vary and can be, for 
example, at least 70%, preferably 75%, more preferably 80%, even more preferably 85%, 
5 and most preferably at least 90%, as determined according to an alignment scheme. 

"Sequence identity" herein means the extent to which two nucleotide or amino acid 
sequences are invariant. 

"Sequence alignment" means the process of lining up two or more sequences to 
achieve maximal levels of sequence identity (and, in the case of amino acid sequences, 
10 conservation), e.g., for the purpose of assessing the degree of sequence similarity. 

Numerous methods for aligning sequences and assessing similarity and/or identity are 
known in the art such as, for example, the ClustalW method, the Cluster Method, wherein 
similarity is based on the MEGALIGN algorithm, as well as BLASTN, BLASTP, and 
FASTA (Lipman and Pearson, 1985; Pearson and Lipman, 1988). When using all of these 
15 programs, the preferred settings are those that result in the highest sequence similarity. 

The term "host cell" means any cell of any organism that is selected, modified, 
transformed, grown, or used or manipulated in any way, for the production of a substance 
by the cell, for example the expression by the cell of a gene, a DNA or RNA sequence, a 
protein or an enzyme. 

20 A "donor substrate" is a molecule recognised by, e.g., a polypeptide GalNAc- 

transferees and that contributes a iV-acetylgalactosamine moiety for the transferase reaction. 
For polypeptide GalNAc-transferases, a donor substrate is UDP-N-acetylgalactosamine or 
with some GalNAc-transferase isoforms UDP-galactose. An "acceptor substrate" is a 
molecule, preferably a peptide, protein, glycopeptide, and glycoprotein, that is recognised 

25 by, e.g., a polypeptide GalNAc-transferase and that is the target for the modification 
catalysed by the transferase, i.e., receives the carbohydrate moiety. For polypeptide 
GalNAc-transferases, acceptor substrates include without limitation peptides, proteins, 
glycopeptides, and glycoproteins. 
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The term "agonist" refers to a molecule that increases the amount of, or prolongs 
the duration of, the activity of the polypeptide. The term "enhancer" refers to a molecule 
that similarly increases the amount of, or prolongs the duration of, the activity of the 
polypeptide. The term "antagonist" refers to a molecule, which decreases the biological or 
5 immunological activity of the polypeptide. The term "inhibitor** similarly refers to a 
molecule, which decreases the biological or immunological activity of the polypeptide. 
Agonists, antagonists, and inhibitors may include proteins, nucleic acids, carbohydrates, or 
any other molecules that associate with a polypeptide GalNAc-transferase. 

The term "agent" includes small molecules, peptide mimetics and polypeptides. 

10 "Mimetics" of GalNAc-transferase lectin-domain inhibitors are molecules that 

functionally mimic the stmcture or function of a GalNAc-transferase lectin-domain 
inhibitor. Molecular mimetics include, but are not limited to: small organic compounds; 
nucleic acids and nucleic acid derivatives; saccharides or oligosaccharides; peptide 
mimetics including peptides, proteins, and derivatives thereof, such as peptides containing 

15 non-peptide organic moieties, synthetic peptides which may or may not contain amino 
acids and/or peptide bonds, but retain the structural and functional features of a peptide 
ligand; pyrrolidines; peptoids and oligopeptoids which are molecules comprising N- 
substituted glycine, such as those described by Simon et al. (1992) Proc. Natl. Acad. Sci. 
USA 89:9367. 

20 The human N-acetylgalactosaminyltransferase Tl gene (GALNTl) has been 

described previously The sequence of the GALNTl mRNA and the sequence of the 
GalNAc-Tl polypeptide have been submitted to GenBank/EBI Data Bank and assigned 
accession numbers X85018 and CAA59380, respectively. 

The hiunan N-acetylgalactosaminyltransferase T2 gene (GALNT2) has been 

25 described previously The sequence of the GALNT2 mRNA and the sequence of the 
GalNAc-T2 polypeptide have been submitted to GenBank/EBI Data Bank and assigned 
accession numbers X85019 and CAA59381, respectively. 

The human N-acetylgalactosaminyltransferase T3 gene (GALNT3) has been 
described previously The sequence of the GALNT3 mRNA and the sequence of the 
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GalNAc-T3 polypeptide have been submitted to GenBank/EBI Data Bank and assigned 
accession numbers X92689 and CAA63371, respectively. 

The human N-acetyigalactosaminyltransferase T4 gene (GALNT4) has been 
described previously The sequence of the GALNT4 mRNA and the sequence of the 
5 GalNAc-T4 polypeptide have been submitted to GenBank/EBI Data Bank and assigned 
accession numbers Y08564 and CAA69875, respectively. 

The human N-acetylgalactosaminyltransferase T5 gene (GALNT5) has been 
described previously . The sequence of the GALNT5 niRNA and the sequence of the 
GalNAc-T5 polypeptide have been submitted to GenBank/EBI Data Bank and assigned 
10 accession numbers AJ245539 and CAB65 1 04, respectively. 

The human N-acetylgalactosaminyltransferase T6 gene (GALNT6) has been 
described previously The sequence of the GALNT6 mRNA and the sequence of the 
GalNAc-T6 polypeptide have been submitted to GenBank/EBI Data Bank and assigned 
accession numbers AJ133523 and CAB55325, respectively.The human N- 
15 acetylgalactosaminyltransferase T7 gene (GALNT7) has been described previously ^. The 
sequence of the GALNT7 mRNA and the sequence of the GalNAc-T7 polypeptide have 
been submitted to GenBank/EBI Data Bank and assigned accession numbers AJ002744 
and CAB60270, respectively. 

The human N-acetylgalactosaminyltransferase T8 gene {GALNT8) has been 
20 described previously The sequence of the GALNT8 mRNA and the sequence of the 
GalNAc-T8 polypeptide have been submitted to GenBank/EBI Data Bank and assigned 
accession numbers AJ271385 and CAB89199, respectively. 

The human N-acetylgalactosaminyltransferase T9 gene {GALNT9) has been 
described previously The sequence of the GALNT9 mRNA and the sequence of the 
25 GalNAc-T8 polypeptide have been submitted to GenBank/EBI Data Bank and assigned 
accession numbers AB040672 and BAB 13699, respectively. 

The human N-acetylgalactosaminyltransferase TIO nucleic acid sequence 
(GALNTIO) and the sequence of the encoded GalNAc-TlO polypeptide have been 
submitted to GenBank/EBI Data Bank. The nucleic acid accession number is AJ505950. 
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The human N-acetylgalactosaminyltransferase Til gene (GALNTll) has been 
described previously ^. The sequence of the GALNTll mRNA and the sequence of the 
GalNAc-Tl 1 polypeptide have been submitted to GenBank/EBI Data Bank and assigned 
accession numbers Y12434 and CAC79625, respectively. 
5 The human N-acetylgalactosaminyltransferase T12 nucleic acid sequence 

(GALNT12) and the sequence of the GalNAc-T12 polypeptide have been submitted to 
GenBank/EBI Data Bank. The nucleic acid accession number is All 32365. This sequence 
is disclosed herewith. 

The human N-acetylgalactosaminyltransferase T13 nucleic acid sequence 
10 (GALNT13) and the sequence of the GaINAc-T13 polypeptide have been submitted to 
GenBank/EBI Data Bank. The nucleic acid accession number is ARl 53422. 

The references cited above for describing human GalNAc-Tl-13 are incorporated 
herem by reference in their entirety and to the same extent as if each reference was 
individually incorporated by reference. 
15 Expression to produce enzymatically-active polypeptide GalNAc-transferases can 

be carried out in any number of conventional expression systems familiar to those skilled in 
the art. In one embodiment, GalNAc-transferases are expressed in a secreted soluble form, 
which can be recovered from the culture medium. Such secreted enzymes lack the N- 
terminal cytoplasmic tail and transmembrane retention sequence, and have N-terminal 
20 sequence starting in the predicted stem region (depicted domain stmctures of polypeptide 
GalNAc-transferases shown in Figure 1). The boundaries of the stem is N-terminally 
defined by the hydrophobic signal sequence, while the C-terminal boundary is less clearly 
defined but limited to the conserved catalytic unit of the enzymes as defined by multiple 
sequence ahgnments. For some isoforms including GarNAc-T2 the N-terminal sequence 
25 have been determined in naturally occurring soluble proteins derived from proteolytic 
cleavage . In another embodiment, host cells (e.g. CHO cells) are engineered to express 
fiill coding GalNAc-transferases and glycosylate substrates in vivo in host cells. 

Expression to produce functional lectin domains of polypeptide GalNAc- 
transferases without the catalytic unit (or activity) can be carried out in any number of 
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conventional expression systems familiar to those skilled in the art. In one embodiment, 
GalNAc-transferase lectins are expressed in a secreted soluble form, which can be 
recovered from the culture medium. Such secreted soluble forms lack the N-terminal 
cytoplasmic tail, transmembrane retention sequence, stem region and the catalytic unit. The 
5 boundaries of the catalytic units and lectin domains are defined by multiple sequence 
alignments and experimentation of lectin binding activity (multiple sequence alignment 
analysis of the C-terminal sequences polypeptide GalNAc-transferases including the most 
C-terminal boundaries of the catalytic domains and the entire lectin domains shovra in 
Figure 2). The boundaries cannot be clearly defined but the most C-terminal well- 

10 conserved sequence motif of the catalytic units (WYLENVYP) can be excluded from the 
lectin domains. Parts of or the entire catalytic domains may be included to produce 
fimctional lectin domains, and inclusion of inactivating mutations in the catalytic units (e.g. 
mutations in the DxH motif important for donor substrate binding, or residues important for 
acceptor substrate binding ^) may be used to avoid additional binding activity mediated 

15 through the catalytic units. Li another embodiment, host cells (e.g. CHO cells) are 
engineered to express full coding polypeptide GalNAc-transferases with or without 
mutations in their catalytic units and binding mediated through lectin domains are 
detremiend in vivo in host cells. 

Cells stably or transiently transfected with full coding or secreted expression 

20 constructs of mucins, mucin-like glycoproteins, 0-glycosylated proteins, or proteins can be 
carried out by any number of conventional methods familiar to those skilled in the art. In 
one embodiment, the mucin MUCl is expressed in a soluble form, which can be recovered 
from the culture medium (Figure 3 illustrates MUCl expression constructs used in this 
invention; the DNA sequence is available from GenBank accession number M61 170). In 

25 another embodiment, host cells (e.g. CHO or CHO IdlD cells) are engineered to express 
MUCl on the cell surface. In a preferred embodiment of the invention the cells are 
mammalian and more preferably, the cells are human. 

Human cell lines expressing cell surface mucins or secreting mucins can be 
selected, cultured and treated by any number of conventional methods familiar to those 

{W:\04305\l 00H154-USl\00085062.DOC liiDDinilllElllieigiffigiDn } 

29 



skilled in the art. In one embodiment, mucins are expressed in a secreted soluble form 
without transmembrane retention sequence, which can be recovered from the culture 
medium. In another embodiment, host cells (e.g. CHO IdLD cells) are engineered to express 
fiill coding mucins on the cell surface.2. 

5 2. General Aspects of the Invention 

The putative lectin domains of some GalNAc-transferases notably GalNAc-T4 and 
-T7 have previously been shown to be important for the GalNAc-glycopeptide substrate 
specificities exhibited by these GalNAc-transferase isoforms (PCT WO 01/82215). The 
mechanism by which the putative lectin domains exert this effect on the enzyme activities 

10 is unknown. However, because GalNAc and GalNAca-benzyl were found to selectively 
inhibit these activities it was hypothesized that the putative lectin domains functioned by 
recognizing the sugar or glycopeptide in a lectin-like interaction^. Considerable efforts have 
been apphed to demonstrate actual binding without success in the past (Bennett et al 
unpublished, personal communications). 

15 In the present invention a direct binding assay was developed using secreted soluble 

GalNAc-T4 and -T2, and chemoenzymatically produced multimeric MUCl tandem repeat 
GalNAc-glycopeptides (Figure 4). Short MUCl glycopeptides of traditional length of 15- 
20 amino acids have failed to provide significant binding in the same assay system, and one 
improvement leading to the success was the application of extended multimeric MUCl 

20 GalNAc-glycopeptides. Binding was also found to an enzymatically GaUsfAc-glycosylated 
fiision protein expressed in E. coli and contaming 30 amino acids of the MUC2 tandem 
repeat. Another improvement was the use of biotinylation of the enzymes, which provided 
an improved signal compared to previous attempts with identifying retained or bound 
enzyme by measuring activity. The specific activities of GalNAc-transferases as measured 

25 in in vitro assays are relatively low, and in past attempts to use binding and elution of 
enzyme acitivity presumably the detection level was not sufficient to detect binding. The 
developed assay was validated to demonstrate binding through the lectin domains by 
several ways: i) binding was selective for GalNAc-glycosylated glycopeptides with no 
significant binding to unglycosylated peptides; ii) a single amino acid substitution in the 
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lectin domains of GalNAc-T4 (and ~T2), known to selectively destroy GalNAc- 
glycopeptide specificity of these enzymes without affecting the catalytic unit ^ abolished 
binding; iii) binding was not affected by EDTA treatment which is known to destroy 
catalytic activity of GalNAc-transferases ^^'^^'^^-^ iv) binding was selectively inhibited by the 
5 monosacharide GalNAc and not by other sugars. 

In order to minimize the size and functional complexity of GalNAc-transferase 
lectins to be used as probes for binding studies, we used multiple sequence alignment 
analysis to predict and design suitable expression constructs for isolated lectin domains 
(Figure 2). Li the present invention a direct binding assay was developed using isolated 

10 lectin domains of GalNAc-T4 and -T2 with minimal size (Figure 5). Analysis of the fine 
specificity of the binding by inhibition studies showed that GalNAc-T2 and -T4 lectins 
exhibit restricted specificity for GalNAc structures, and surprisingly that the anomeric 
configuration of the GalNAc residue is not important. Thus, both GalNAca- and 
GalNAcP-benzyl inhibited binding to the same degree. The lectin Helix Pomatia (HP) was 

15 used as a control plant lectin with known binding specificity for GalNAca-structures. HP 
lectin showed a very different highly preferred binding specificity for GalNAca-structures. 

Studies with GalNAca-benzyl have shown that this compound is effectively taken 
up by cells and used in the Golgi compartments It is also well known in the art that 
sugar-aryl compounds are taken up by the cell and used in the Golgi compartments. Thus, 

20 the surprising finding that GalNAc-transferase lectins can be inhibited by PGalNAc 
(GalNAcp-benzyl), provides a new tool to study the function of polypeptide GalNAc- 
transferases in vivo; GalNAcp-benzyl, because it, too, will enter the cell and be used in the 
Golgi compartments. 

AvailabiUty of a binding assay is a usefiil tool to identify and characterize inhibitors 

25 of GalNAc-transferase lectins. In accordance with the binding assay method of the 
invention, GalNAc-transferases are contacted with a glycopeptide, glycoprotein, 
fiisionprotein, or other appropriate structure or polymer containing the sugar hapten 
structure recognized by the GalNAc-transferase lectins, preferably iV-acetylgalactosamine, 
and the GalNAc-transferase protein or lectin bound is quantitatively measured. The 
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GalNAc-transferases may be in the form of a secreted soluble construct as applied in this 
invention, and any extended or truncated construct of a GalNAc-transferase as well as 
hybrid fusion protein that maintains the binding properties of the lectin. The ligand may be 
in the form of a chemoenzymatically produced MUCl GalNAc-glycopeptide as applied in 
5 this invention, and any glycopeptide, glycoprotein, fusionprotein of any size or sequence, 
or other appropriate structure or polymer containing the sugar hapten structure recognized 
by the GalNAc-transferase lectins. Synthesis and chemoenzymatic synthesis of 
glycopeptides are familiar to those in the art, and are described in the literature cited 
above and in the Examples provided below. The ligand sugar may be GalNAc, N- 

10 acetylgalactosamine, or any other sugar in any linkage and sequence recognized by a 
GalNAc-transferase lectin. The binding assay may be an enzyme-linked sohd phase 
immunoadsorption assay (ELIS A) as appUed in the invention, and any variant assay hereof 
where binding to ligand can be detected including without limitation radioimmunoassay 
(RIA), surface plasmon resonance (SPR), chemoluminescense, nuclear magnetic 

15 resonance spectroscopy (^H-NMR), and other methods know in the art. Binding may be 
detected by horse-radish-peroxidase HRP-Avidin biotin as appKed in this invention, and 
any other detection system including without limitation enzyme reactions, fluorescence, 
radioactivity, spectroscopy, spectrometry and other methods. The GalNAc-transferases 
may be labelled by biotinylation as appUed in this invention, and any other labelling 

20 including without limitation antibody tags, enzymes, fluorochromes, radioisotopes and 

other methods know in the art, as well as detected by antibodies, phage antibody fragments 
or other binding proteins. The assay may used to characterize binding specificities of 
GalNAc-transferase lectins, screen and identify inhibitors of GalNAc-transferase lectins, 
and screen and identify competitive binders such as different GalNAc-transferase lectins 

25 and other lectins and proteins with binding properties for carbohydrates. 

An in vivo model system for secretion of mucins was developed. A truncated 
secreted expression construct of the human cell surface mucin MUCl containing 32 
tandem repeats (Figure 3), was stably transfected into CHO wild type and CHO IdlD cells 
Figure 6 illustrates intracellur expression of soluble MUCl in wild type CHO 
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transfectant clone, wtCHO/MUClsol-clone-C4, visualized by multiple monoclonal anti- 
MUCl antibodies. Analysis of glycosylation was performed with a panel of antibodies with 
well-defined specificities for carbohydrate structures, and reactivity was mainly found with 
anti-T after pre-treatment with neuraminidase to remove sialic acids. Weak staining with 
5 anti-Tn was also found in some cells. Figure 7 illustrates western blot analysis of secreted 
MUCl firom the same cells. High molecular weight MUCl migrating with apparent mw 
higher than 300 Kd is labelled by HMFG2, SMS, and VU-2G7, while all antibodies 
including VU-4H5 label a low molecular weight MUCl migrating with apparent mw of 
130 Kd and presumed to represent virtually unglycosylated MUCl. Pretreatment with 

10 neuraminidase decreased migration of the high molecular weight MUC 1 bands, and anti-T 
antibody reactivity emerged. Stable MUCl transfectants in CHO IdlD showed similar 
patterns of reactivity when grown in Gal and GalNAc. 

An in vivo model system for cell surface expression of mucins was developed. A 
fiiU coding expression construct of the human cell surface mucin MUCl containing 32 

15 tandem repeats (Figure 3), was stably transfected into CHO wild type and CHO IdlD cells. 
CHO IdlD cells were originally established by Krieger et al and found to have a defect 
in UDP-Gal/GalNAc epimerase that renders the cells incapable of producing UDP-Gal and 
UDP-GalNAc. Lack of UDP-Gal limits the synthesis of all types of glycoconjugates 
including glycosphingolipids, N-linked and O-linked glycoproteins. The synthesis of O- 

20 Unked glycoproteins will be arrested at GalNAcal-O-Ser/Thr with or without addition of 
a2,6 linked sialic acid. In the absence UDP-GalNAc mainly O-linked mucin-type 
glycoconjugates are affected, and essentially no glycosylation occur, as the first sugar 
attached is GalNAc. The defect in CHO IdlD cells can be selectively restored by addition of 
1 mM GalNAc and or 0. 1 mM Gal to the growth medium ^, Addition of both sugars 

25 essentially restores normal glycosylation, whereas addition of GalNAc alone limits O- 

glycosylation to GalNAcal-O-Ser/Thr with or without addition of a2,6 linked siaHc acid, 
and also affects galactosylation of N-linked glycosylation and glycolipid biosynthesis. 
Altschuler et al have previously shown that cell surface expression of MUCl in CHO 
IdlD cells requires addition of GalNAc. 
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Cell surface expression of MUCl was established in stably transfected CHO 
wildtype and CHO IdlD cells. MUCl was detected at the surface of non-permeabilized 
cells using monoclonal anti-MUCl antibodies (Figure 8). In accordance with Alschuler et 
al MUCl surface expression in CHO IdlD cells was only found in cells grown in 
5 GalNAc or Gal and GalNAc, whereas cells grown without sugars or only in Gal failed to 
express MUCl at the surface. In agreement with the conclusion drawn by Altchuler et al 
surface expression of MUCl was dependent only on the first step in 0-glycosylation, the 
addition of GalNAc. 

MUCl produced in CHO IdlD cells grown without GalNAc is not accumulated in 

10 Golgi, but degraded in lysosomes This indicates that measuring total MUCl in cell 
lysates rather than exclusively at the cell surface may be used as a measure of MUCl 
expression. The experiments shown in Figures 9-10 use inununoprecipitation of total cell 
lysates with anti-MUCl antibody followed by westem blot analysis with the same or 
different anti-MUCl antibody to quantify and characterize MUCl expression in cells. 

15 MUCl produced in cell grown without GalNAc or only in the presence of Gal migrate 

close to the predicted mass of the protein core. With the addition of GalNAc to the medium 
high molecular weight forms of MUCl are found, and these react with all antibodies except 
VU-2G7. The antibody 5E5 only reacts with Tn glycoforms and lack of reactivity with 
MUCl fi-om cells grown in Gal and GalNAc indicate that the majority of MUCl produced 

20 is glycosylated with sialyl-T structures. 

GalNAca-benzyl is a well-known inhibitor of 0-glycosylation extension 
Treatment of cells with 1-2 mM GalNAca-benzyl partially blocks core 1 0-glycosylation 
including a2,3 sialylation. Treatment of cells with GalNAca-benzyl is also known to affect 
surface expression of mucins and 0-glycosylated glycoproteins, as well as in some cases 

25 secretion of mucins. A number of mammalian cell lines have been treated with 1-2 mM 
GalNAca-benzyl in the past and the resulting effects on 0-glycosylation as well as mucin 
secretion have varied with cell type (for a detailed review see ^^), 

The effect of GalNAca-benzyl on mucin transport and secretion has been 
concluded to be due to blockage of 0-glycosylation extension Figures 11-13 illustrate 
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that CHO IdlD cells grown in Gal and GalNAc (or wild type CHO cells), and treated with 2 
mM GalNAca-benzyl in agreement with this, exhibits reduced expression of MUCl as 
well as altered 0-glycosylation as judged by an altered SDS-PAGE migration pattern. 
Wildtype CHO cells as well as CHO IdlD cells grown in Gal and GalNAc produce O- 
5 glycans of the mono- and disialylated core 1 structures (NeuAca2-3Gaipi-3[NeuAca2-6] 
+/-GalNAca-0-Ser/Thr). Treatment with GalNAca-benzyl results in some exposure of 
unsialylated core 1 as evaluated by staining with anti-T monoclonal antibody HH8, 
whereas only very little Tn is exposed as evaluated with anti-Tn monoclonal antibody 5F4. 
The altered SDS-PAGE migration of MUCl produced in CHO IdlD cells grown in Gal and 

10 GalNAc (or wild type CHO cells, not shown) shown in Figures 12-13 (lanes 4 and 10 when 
indicated) reflects mainly loss of sialic acids. 

If the effect of GalNAca-benzyl treatment on mucin transport and secretion is due 
to inhibition of sialylation, then treatment of CHO IdlD cells grown only in GalNAc and 
hence producing only GalNAcal-O-Ser/Thr 0-glycosylation (neglible STn is produced as 

15 evidenced by lack of staining with anti-STn monoclonal antibodies 3F1 and TKH2, while 
cells stain very strongly with anti-Tn monoclonal antibodies 5F4 and 1E3), should have no 
effect on expression of MUCl in these cells. Surprisingly as shown in Figures 12-13 (lanes 
1 and 7 when indicated), GalNAca-benzyl treatment does inhibit MUCl expression in 
CHO IdlD cells with O-glycosylation controlled and limited to Tn glycoforms. This result 

20 shows for the first time that mucin transport and secretion may be directly affected by 
treatment with GalNAca-benzyl and not through a mechanism involving inhibtion of 
sialylation or the O-glycosylation extension pathways. Combined with the findings of 
Altchuler et al these results indicate that mucin transport and secretion requires some 
degree of GalNAc O-glycosylation, whereas 0-glycan extension including sialylation 

25 seems to be of less importance for this process. 

An appropriate control for GalNAca-benzyl treatment has not previously been 
studied. Selection of a benzyl monosaccharide that is not involved in and does not affect 
glycosylation pathways in cells is problematic. We chose to use GlcNAca-benzyl as such a 
control as this structure is not used in glycosylation pathways of mammaUan glycoproteins 
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and glycosphingolipids. As shown in Figures 12-13 (lanes 2,5, 8, and 11, when indicated) 
treatment of CHO IdlD cells with 2 mM GlcNAca-benzyl had no effect on MUCl 
expression and glycosylation. GlcNAca-benzyl thus serves as a control for treatment of 
cells with benzyl sugars, and this is important because benzyl sugars and their biosynthetic 
5 products appear to aggregate in cells and cause morphological changes with prolonged 
treatment 

Since transport of mucin in cells was selectively inhibited by GalNAca-benzyl (and 
not GlcNAca-benzyl), even in cells limited to GalNAcal-O-Ser/Thr 0-glycosylation, we 
hypothesized that polypeptide GalNAc-transferases and in particular their lectin domains 

10 could be involved in ensuring mucin transport and preventing direction to lysosomes. On 
the one hand, GlcNAca-benzyl does not in general inhibit GalNAc 0-glycosylation and 
polypeptide GalNAc-transferase enzyme activity. On the other, we had previously 
discovered that the GalNAc-glycopeptide acceptor substrate specificities of some GalNAc- 
transferase isoforms including GalNAc-T4 and -T7 ^(PCT WO 01/85215), which suggested 

15 that these enzymes could be involved. These precepts thus give rise to plural hypothesis. 
One hypothesis would suggest that inhibition of the GalNAc-glycopeptide acceptor 
substrate specificity of GalNAc-transferases leads to mucin glycoforms with lower density 
of 0-glycan occupancy (shown in vitro for e.g. MUCl tandem repeats ^), and that this 
decrease in 0-glycan density results in increased targeting to lysosomal degradation and 

20 hence decrease in expression. Another hypothesis would suggest that the lectin domains of 
GalNAc-transferases in general have the capacity to bind GalNAc and hence provide a 
lectin mediated chaperone-like fimction, which is required for Golgi transport of O- 
glycosylated proteins. Lectin chaperones are well known to function ER transport as well 
as in lysosomal targeting but the existence of such lectin chaperones for cell surface 

25 expression and secretion have not been demonstrated in the Golgi or trans-Golgi network. 

As described above we found in the present invention that GalNAc-transferase 
lectins in a binding assay to GalNAc-glycopeptides surprisingly showed similar inhibition 
with GalNAca-benzyl and GalNAcp-benzyl. This indicates that these lectins in contrast to 
many lectins including Helix Pomatia fail to distinguish the anomeric configuration of the 
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monosaccharide hapten recognized. pGalNAc is a rare linkage in mammalian 
glycoproteins and is found only in N-linked glycoproteins in man associated with the 
hormone specific glycosylation pattem where it generally is sulphated. Although, pGalNAc 
is found in both ganglioseries (GalNAcpl-4Gaipi-4Glcpl-Cer) and globoseries 

5 (GalNAcpi-3Galal-4Gaipi-4Glcpi-Cer) it is expected that treatment of cells with 2 mM 
GalNAcP-benzyl will not interfere significantly with glycosylation. This is based on the 
findings that the pl,3galactosyltransferases (p3GaH4 and p3Gal-T5, respectively) 
involved in extending pGalNAc in these two glycolipid structures show no or very poor 
activity with GalNAcp-benzyl The only pGalNAc containing structure in 0-linked 

10 glycosylation is found in the blood group related Sda structure (GalNAcpi-4(Neua2- 
3)Galpl-3GalNAcal-0-Ser/Thr) which has very restricted expression 

We therefore tested if GalNAcp-benzyl treatment of cells showed the same effects 
as GalNAca-benzyl treatment. As shown in Figures 14-15 (lane 5) GalNAcp-benzyl 
treatment does not interfere with 0-glycosylation m contrast to GalNAca-benzyl (lane 4), 

15 as no difference in SDS-PAGE migration is observed. However, GalNAca-benzyl, as well 
as GalNAcp-benzyl treatment of cells, produces similar significant reduction in expression 
of MUCl Figures 14-15. This shows surprisingly and for the first time that GaMAcp- 
benzyl represents a selective inhibitor of mucin transport and secretion. GalNAcp-benzyl is 
a novel preferred inhibitor of transport, surface expression and secretion of 0-glycosylated 

20 proteins and mucins, because it does not interfere with the 0-glycosylation extension 
process. GalNAcp-benzyl fiirther does not accumulate biosynthetic oligosaccharide 
products similar to those found with GalNAca-benzyl treatment (see Example 8) The 
finding that GaMAcP-benzyl exerts these effects on mucin expression combined with the 
finding that it inhibits polypeptide GalNAc-transferases strongly indicate that the 

25 mechanism by which GalNAca- and GalNAcP-benzyl inhibits mucin expression is through 
inhibition of GalNAc-transferase lectins. This supports the second hypothesis articulated 
above. Polypeptide GalNAc-transferase lectins thus represent prime targets for intervention 
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with mucin secretion and cell surface expression, and GalNAcp-benzyl represents a novel 
selective prototype inhibitor for such intervention. 

Preferred compounds for inhibition of GalNAc-transferase lectins are inactive as 
acceptor substrates for glycosyltransferases. In particular, the following glycosyltransferase 

5 activities: core 1 UDP-Gal:GalNAc-peptide pi,3galactosyltransferases, CMP- 
NeuAc:GalNAc-peptide a2,6sialyltransferases, and UDP-GlcNAc:pi,3N- 
acetylglucosaminyltransferases involved in 0-glycosylation, are inactive with the preferred 
inhibitory compounds. Examples of such inhibitory compounds are GalNAcal-O-benzyl 
with substitution of hydroxyl groups at C3 and/or C6 with methyl or acetyl groups to block 

10 acceptor sites. 

The methods described herein are designed to identify substances and compounds 
that bind to and or modulate the biological activity of a polypeptide GalNAc-transferase 
lectin, including substances that interfere with or enhance the activity of a polypeptide 
GalNAc-transferase lectin. 

15 GalNAc-transferase lectins may be used in the form of a truncated lectin domain as 

shown in Example 3, as a secreted GalNAc-transferase enzyme as shown in Example 2, or 
as a truncated protein or fusion protein with or without catalytic activity but with retained 
lectin domain and carbohydrate binding activity. 

Agents that modulate a polypeptide GalNAc-transferase lectin can be identified 

20 based on their ability to associate with such a lectin. Therefore, the invention also provides 
a method of identifying agents that associate with a polypeptide GalNAc-transferase lectin. 
Agents identified using the method of the invention may be isolated, cloned and sequenced 
using conventional techniques. An agent that associates with a polypeptide GalNAc- 
transferase lectin may be an agonist or antagonist of the biological or immunological 

25 activity of the lectin. 

Agents that can associate with a polypeptide GalNAc-transferase lectin may be 
identified by reacting such GalNAc-transferase lectin with a test substance, which 
potentially associates with a polypeptide GalNAc-transferase lectin under conditions which 
permit the association, and removing and/or detecting the associated GalNAc-transferase 
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lectin and substance. The substance-lectin complex, free substance, or non-complexed 
lectin may be assayed. Conditions, which peraiit the fomiation of substance-lectin 
complexes, may be selected having regard to factors such as the nature and amounts of the 
substance and the lectin. 
5 The substance-lectin complex, free substance or non-complexed lectin may be 

isolated by conventional isolation techniques, for example, salting out, chromatography, 
electrophoresis, gel filtration, fractionation, absorption, polyacrylamide gel electrophoresis, 
agglutination, or combinations thereof To facilitate the assay of the components, a labelled 
antibody against the transferase or the substance or a labelled lectin or a labelled substance 

10 may be utilized. The antibodies, lectins, or test substances may be labelled with a detectable 
substance as described above. 

A polypeptide GalNAc-transferase lectin, or a test substance used in the method of 
the invention may be insolubiUzed. For example, a lectin, or a test substance may be bound 
to a suitable carrier such as agarose, cellulose, dextran, Sephadex, Sepharose, 

15 carboxymethyl cellulose polystyrene, filter paper, ion-exchange resin, plastic fihn, plastic 
tube, glass beads, polyamine-methyl vinyl-ether-maleic acid copolymer, amino acid 
copolymer, ethylene-maleic acid copolymer, nylon, silk, etc. The carrier may be in the 
shape of, for example, a tube, test plate, beads, disc, sphere etc. The insolubiUzed lectin or 
substance may be prepared by reacting the material with a suitable insoluble carrier using 

20 known chemical or physical methods, for example, cyanogen bromide coupling. 

The invention also contemplates a method for evaluating an agent for its abihty to 
modulate the biological activity of a polypeptide GalNAc-transferase lectin by assaying for 
an agonist or antagonist (i.e. enhancer or inhibitor) of the association of the lectin with a 
substance that interacts with the polypeptide (e.g. carbohydrate binding site or parts 

25 thereof). The basic method for evaluating whether an agent is an agonist or antagonist of 
the association of a polypeptide GalNAc-transferase lectin and a substance that associates 
with the lectin is to prepare a reaction mixture containing the lectin and the substance under 
conditions which permit the formation of substance-lectin complexes, in the presence of a 
test agent. The test agent may be initially added to the mixture, or may be added subsequent 
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to the addition of the lectin and substance. Control reaction mixtures without the test agent 

or with a placebo are also prepared. The formation of complexes is detected and the 

formation of complexes in the control reaction, but not in the reaction mixture, indicates 

that the test agent interferes with the interaction of the lectin and substance. The reactions 
5 may be carried out in the liquid phase or the lectin, substance, or test agent may be 

inmiobilized as described herein. 

It will be understood that the agonists and antagonists, i.e. enhancers and inhibitors, 

that can be assayed using the methods of the invention may act on one or more of the 

interaction sites on the lectin or substance including agonist binding sites, competitive 
10 antagonist binding sites, non-competitive antagonist binding sites or allosteric sites. It will 

also be imderstood that competitive assays, in addition to direct assays, can be used to 

screen for and identify the agents of the present invention. 

The invention also makes it possible to screen for antagonists that inhibit the effects 

of an agonist of the interaction of a polypeptide GalNAc-transferase lectin with a substance 
15 capable of associating with the lectin. Thus, the invention may be used to assay for an agent 

that competes for the same interacting site of a polypeptide GalNAc-transferase lectin. 
Test compounds are screened from, for example, large libraries of synthetic or 

natural compounds. Numerous means are currently used for random and directed synthesis 

of saccharide, peptide, and nucleic acid based compounds. Examples of available libraries 
20 are synthetic compound libraries are commercially available from Maybridge Chemical 

Co. (Trevillet, Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates 

(Merrimack, NH), and Microsource (New Milford, CT). 

Agents which are effective in modulating a polypeptide GalNAc-transferase lectin 

can be identified, based on their ability to interfere with or enhance the lectin mediated 
25 binding capacity of the GalNAc-transferase protein or fragment hereof containing the lectin 

region. Therefore, the invention provides a method for evaluating a test substance for its 

ability to modulate the binding capacity of a polypeptide GalNAc-transferase lectin 

comprising 
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(a) reacting a binding substrate with a GalNAc-transferase or lectin polypeptide or 
fragment hereof in the presence of a test substance; 

(b) measuring the amoimt of binding substrate bound to the GalNAc-transferase 
polypeptide or fragment hereof, and 

5 (c) carrying out steps (a) and (b) in the absence of the test substance to determine if 
the substance interferes with or enhances the binding by the polypeptide 
GalNAc-transferase. 

Suitable binding substrates for use in the methods of the invention are polypeptides, 
glycopolypeptides, or glycoproteins, which are either synthetic or naturally occurring 

10 structures. The GalNAc-transferase lectin polypeptide may be obtained from natural 
sources or produced using recombinant methods as described and referenced herein. 

The binding or modifying substrates may be labelled with a detectable substance as 
described herein, and the interaction of the polypeptide of the invention with the binding or 
modifying substrates will give rise to a detectable change. The detectable change may be 

15 colorimetric, photometric, radiometric, potentiometric, etc. The GalNAc-transferase lectin 
polypeptide is reacted with the binding or modifying substrates at a pH and temperature 
effective for the polypeptide to bind the substrates, and where preferably one of the 
components is labeled, to produce a detectable change. It is preferred to use a buffer with 
the substrates to maintain the pH within the pH range effective for the polypeptides. The 

20 buffer and substrates may be used as an assay composition. Other compounds such as 
EDTA and detergents may be added to the assay composition. 

The reagents suitable for applying the methods of the invention to evaluate agents 
that modulate a polypeptide GalNAc-transferase lectin may be packaged into convenient 
kits providing the necessary materials packaged into suitable containers. The kits may also 

25 include suitable supports useful in performing the methods of the invention. 

Agents that modulate polypeptide GalNAc-transferase lectin(s) can also be 
identified by treating immortalized cells which express the transferase(s) with a test 
substance, and comparing the intracellular transport, degradation, surface expression, or 
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secretion of 0-glycosylated proteins, mucins, and glycoproteins performed of the cells with 
those of the cells in the absence of the test substance and/or with immortalized cells which 
do not express the transferase(s). Examples of immortalized cells that can be used include 
human cell lines, Chinese hamster ovary (CHO) cells and mutant cells CHO IdlD which 
5 express polypeptide GalNAc-transferase(s) or lectin(s) and produce cell membrane bound 
or secereted forms of the human mucin MUCl . In the absence of an inhibitor the cells will 
produce and transport MUCl to the cell surface or secrete MUCl into the growth medium. 
Substances that reduce the cell surface expression or the quantity of MUCl in the medium 
may be considered an inhibitor. 

10 The agents identified by the methods described herein, may be used for modulating 

the binding activity of a polypeptide GalNAc-transferase lectin, and they may be used as 
prototype drugs in the treatment of conditions mediated by a polypeptide GalNAc- 
transferase lectin and in designing further substances effective to treat such conditions. In 
particular, they may be used to alter density of 0-glycosylation on glycoproteins and 

15 mucins produced by cells, the intracellular transport and surface expression of 

glycoproteins and mucins, the secretion of glycoproteins and mucins, and other functions 
governed by the polypeptide GalNAc-transferases and their lectins in transport and 
secretion of glycoproteins and mucins. 

Therefore, the present invention has potential application in the treatment of various 

20 disorders associated with aberrant 0-glycosylation and/or mucin production in mammals, 
preferably himians. Such disorders include the following: tumors and cancers, lungs 
diseases associated with mucous accumulation such as asthma, chronic bronchitis, 
smoker's lung, cystic fibrosis, diseases of exocrine glands associated with increased or 
decreased mucin secretion such as Sj0grens syndrome, dry mouth etc. Other disorders 

25 include dysregulation of selectin-mediated leukocyte trafficking and would include but not 
be limited to disorders involving autoimmunity, arthritis, leukaemia's, lymphomas, 
immunosuppression, sepsis, wound healing, acute and chronic in action, cell mediated 
immunity, and the like. 
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In one embodiment, the invention provides inhibitors of the secretion of mucin or 
0-glycosylation products, which inhibitors do not induce intracellular accumulation of 
vesicles containing the product, as observed in, e,g., storage diseases or conditions 
associated with defects in lysosomal degradation. Screening methods to identify such 
5 inhibitors are also provided. While GalNAcoj-benzyl is capable of inhibiting O- 
glycosylation and mucin secretion, it is also associated with intracellular build-up of 
vesicles with the product^' . See FIG. 1 8. The invention further provides agents capable 
of preventing or treating disorders associated with aberrant mucin production and/or 
secretion in mammals, preferably humans, which agents inhibit secretion of one or more 

10 mucins without the disadvantage of extensive intracellular accumulation of vesicles 

containing the mucin. Exemplary disorders are Usted above. This embodiment is based, in 
part, on the observation that GalNAcB-benzyl, in contrast to GalNAca-benzyl, does not 
accumulate biosynthetic oligosaccharide products intracellularly and, as described in 
Example 8, blocks MUC5AC secretion without leading to intracellular vesicle 

15 accumulation or storage disease phenotype in HT-29 cells. Thus, according to the present 
invention, GalNAcB-benzyl, as well as variants and derivatives thereof that also possess 
this functional characteristic, can be used to inhibit secretion of MUC5AC or other mucins 
without leading to intravesicular accumulation of the mucin in any mucin-producing and/or 
-secreting cell (in particular a mammalian, preferably human, cell) treated in vivo or in vitro 

20 with the GalNAcB-benzyl compound or derivatives thereof 

The agents identified by the methods described herein, have potential application in 
treatment of tumors including inhibition of tumor metastasis and growth and/or regression 
of same. Tumor metastasis may be inhibited by inhibiting the adhesion of circulating 
cancer cells. The agents of the invention have particular potential application may be 

25 especially useful in the treatment of various forms of neoplasia such as leukaemias, 

lymphomas, melanomas, adenomas, sarcomas, and carcinomas of solid tissues in patients. 
In particular the composition may be used for treating malignant melanoma, pancreatic 
cancer, cervico-uterine cancer, cancer of the liver, kidney, stomach, lung, rectum, breast, 
bowel, gastric, thyroid, neck, cervix, salivary gland, bile duct, pelvis, mediastinum, urethra, 
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bronchogenic, bladder, esophagus and colon, and Kaposi's Sarcoma which is a form of 
cancer associated with HIV-infected patients with Acquired Immune Deficiency Syndrome 
(AIDS). The substances etc. are particularly useful in the prevention and treatment of 
tumors of lining mucosa and glands and the metastases derived firom these tumors. 
5 Accordingly, the various agents may be formulated into pharmaceutical 

compositions for administration to subjects in a biologically compatible form suitable for 
administration in vivo. By biologically compatible form suitable for administration in vivo 
is meant a form of the agent to be administered in which any toxic effects are outweighed 
by the therapeutic effects. The agents may be administered to Uving organisms including 

10 humans, and animals. Administration of a therapeutically active amount of the 

pharmaceutical compositions of the present invention is defined as an amount effective, at 
dosages and for periods of time necessary to achieve the desired result. For example, a 
therapeutically active amount of an agent may vary according to factors such as the disease 
state, age, sex, and weight of the individual, and the ability of pharmaceutical composition 

15 or polypeptide to elicit a desired response in the individual. Dosage regima may be adjusted 
to provide the optimum therapeutic response. For example, several divided doses may be 
administered daily or the dose may be proportionally reduced as indicated by the 
exigencies of the therapeutic situation. 

The active agent may be administered in a convenient manner such as by injection 

20 (subcutaneous, intravenous, etc.), oral administration, inhalation, transdermal application, 
or rectal administration. Depending on the route of administration, the active agent may be 
coated in a material to protect the agent fi-om the action of enzymes, acids and other natural 
conditions that may inactivate it. 

The compositions described herein can be prepared by methods known per se for 

25 the preparation of pharmaceutically acceptable compositions which can be administered to 
subjects, such that an effective quantity of the active agent is combined in a mixture with a 
pharmaceutically acceptable vehicle. Suitable vehicles are described, for example, in 
Remington's Pharmaceutical Sciences (Remington's Pharmaceutical Sciences, Mack 
Publishing Company, Easton, Pa., USA 1985). On this basis, the compositions include, 
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albeit not exclusively, solutions of the agents in association with one or more 
pharmaceutically acceptable vehicles or diluents, and contained in buffered solutions with a 
suitable pH and iso-osmotic with the physiological fluids. 

The phrase "pharmaceutically acceptable" refers to molecular entities and 
5 compositions that are physiologically tolerable and do not typically produce an allergic or 
similar untoward reaction (for example, gastric upset, dizziness and the like) when 
administered to an individual. Preferably, and particularly where a immunogenic 
composition is used in humans, the term "pharmaceutically acceptable" denotes approved 
by a regulatory agency (for example, the U.S. Food and Drug Agency) or listed in a 
10 generally recognized pharmacopeia for use in animals (for example, the U.S. 
Pharmacopeia). 

Toxicity and therapeutic efficacy of compounds can be determined by standard 
pharmaceutical procedures, for example in cell culture assays or using experimental 
animals to determine the LD50 and the ED50. The parameters LD50 and ED50 are well 

15 known in the art, and refer to the doses of a compound that are lethal to 50% of a 

population and therapeutically effective in 50% of a population, respectively. The dose 
ratio between toxic and therapeutic effects is referred to as the therapeutic index and may 
be expressed as the ratio: LD50/ED50. Compounds that exhibit large therapeutic indices 
are preferred. While compounds that exhibit toxic side effects may be used. However, in 

20 such instances it is particularly preferable to use delivery systems that specifically target 
such compounds to the site of affected tissue so as to minimize potential damage to other 
cells, tissues or organs and to reduce side effects. 

Data obtained fi-om cell culture assay or animal studies may be used to formulate a 
range of dosages for use in humans. The dosage of compounds used in therapeutic 

25 methods of the present invention preferably he within a range of circulating concentrations 
that includes the ED50 concentration but with little or no toxicity (e.g., below the LD50 
concentration). The particular dosage used in any application may vary within this range, 
depending upon factors such as the particular dosage form employed, the route of 
administration utilized, the conditions of the individual (e.g., patient), and so forth. 

{W:\04305\100H154-US1\00085062.DOC liililinillllDIDIIIDIIIfflEDn } 

45 



Non-human animals include, without hmitation, laboratory animals such as mice, 
rats, rabbits, hamsters, guinea pigs, etc.; domestic animals such as dogs and cats; farm 
animals such as sheep, goats, pigs, horses, and cows; and non-human primates. 

A therapeutically effective dose may be initially estimated from cell culture assays 
5 and formulated in animal models to achieve a circulating concentration range that includes 
the IC50. The IC50 concentration of a compound is the concentration that achieves a half- 
maximal inhibition of symptoms (e.g., as determined from the cell culture assays). 
Appropriate dosages for use in a particular individual, for example in human patients, may 
then be more accurately determined using such information. 
10 Measures of compounds in plasma may be routinely measured in an individual such 

as a patient by techniques such as high performance liquid chromatography (HPLC) or gas 
chromatography. 

Pharmaceutical compositions for use in accordance with the present invention may 
be formulated in conventional manner using one or more physiologically acceptable 

1 5 carriers or excipients. 

Thus, the compounds and their physiologically acceptable salts and solvates may be 
formulated for administration by the routes described above. 

For oral administration, the pharmaceutical compositions may take the form of, for 
example, tablets or capsules prepared by conventional means with pharmaceutically 

20 acceptable excipients such as binding agents (e.g., pregelatinised maize starch, 
polyvinylpyrrolidone or hydroxypropyl methylcellulose); fillers (e.g., lactose, 
microcrystalline cellulose or calcium hydrogen phosphate); lubricants (e.g., magnesium 
stearate, talc or siUca); disintegrants (e.g., potato starch or sodium starch glycolate); or 
wetting agents (e.g., sodium lauryl sulphate). The tablets may be coated by methods well 

25 known in the art. Liquid preparations for oral administration may take the form of, for 
example, solutions, syrups or suspensions, or they may be presented as a dry product for 
constitution with water or other suitable vehicle before use. Such hquid preparations may 
be prepared by conventional means with pharmaceutically acceptable additives such as 
suspending agents (e.g., sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 
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emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e.g., almond oil, oily 
esters, ethyl alcohol or fractionated vegetable oils); and preservatives (e.g., methyl or 
propyl-p-hydroxybenzoates or sorbic acid). The preparations may also contain buffer salts, 
flavoring, coloring and sweetening agents as appropriate. 
5 Preparations for oral administration may be suitably formulated to give controlled 

release of the active compound. For buccal administration the compositions may take the 
form of tablets or lozenges formulated in conventional manner. For administration by 
inhalation, the compounds for use according to the present invention are conveniently 
delivered in the form of an aerosol spray presentation from pressurized packs or a 

10 nebuliser, with the use of a suitable propellant, e.g., dichlorodifluoromethane, 

trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In 
the case of a pressurized aerosol the dosage unit may be determined by providing a valve to 
deUver a metered amount. Capsules and cartridges of e.g., gelatin for use in an inhaler or 
insufflator may be formulated containing a powder mix of the compound and a suitable 

15 powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by injection, e.g., 
by bolus injection or continuous infusion. Formulations for injection may be presented in 
unit dosage form, e.g., in ampoules or in multi-dose containers, with an added preservative. 
The compositions may take such forms as suspensions, solutions or emulsions in oily or 

20 aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing 
and/or dispersing agents. Alternatively, the active ingredient may be in powder form for 
constitution with a suitable vehicle, e.g., sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as 
suppositories or retention enemas, e.g., containing conventional suppository bases such as 

25 cocoa butter or other glycerides. 

In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular 
injection. Thus, for example, the compounds may be formulated with suitable polymeric or 
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hydrophobic materials (for example as an emulsion in an acceptable oil) or ion exchange 
resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt. 

The compositions may, if desired, be presented in a pack or dispenser device that 
may contain one or more xmit dosage forms containing the active ingredient. The pack may 
5 for example comprise metal or plastic foil, such as a blister pack. The pack or dispenser 
device may be accompanied by instructions for administration. 

After pharmaceutical compositions have been prepared, they can be placed in an 
appropriate container and labelled for treatment of an indicated condition. For 
administration of an inhibitor of a polypeptide GalNAc-transferase, such labelling would 
10 include amount, frequency, and method of administration. 

The use of inhibitors of the lectin domain mediated activities of the above 
mentioned polypeptide GalNAc-transferase isoforms and other isoforms allows for unique 
selective inhibition of these functions in vitro and in vivo in cells and organisms. This is 
desirable in manipulating the density of 0-glycans, e.g. changing high density O- 
15 glycosylated tumour-associated MUCl to low density normal MUCl in cells. Further this 
is desirable in inhibiting any adhesive role the lectin domains may play in Golgi transport 
and intracellular sorting. 

EXAMPLES 

1. Cloning, Expression, and Purification of Soluble GalNAc-Transferase 
20 Proteins and Soluble GalNAc-Transferase Lectins. 

Polypeptide GalNAc-transferases are highly conserved throughout evolution. 
Orthologous relationships can be defined from man to Drosophila, 48 and ortholgous 
members of all human polypetide GalNAc-transferase isoforms are clearly identifiable in 
mouse and rats, and likely all mammals. 
25 Polypeptide GalNAc-transferases are predicted to be type 11 transmembrane Golgi- 

resident proteins with a domain stmcture depicted in Figure 1 2. The N-terminal 
cytoplasmic tail, the hydrophic transmembrane signal sequence, and the stem region may 
be involved in directing Golgi-localization 47. The catalytic unit of the enzymes is 
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approximately 300-350 amino acid residues and highly conserved in primary sequence 
among isoforms and also throughout evolution of the gene family 3,48. The C-terminal 
region of approximately 130 amino acids exhibits similarity with the galactose binding 
lectin, ricin. This region show little sequence similarity among isoforms and is poorly 

5 conserved in evolution 3. 

Soluble, secreted expression constructs of human GalNAc-transferases GalNAc-Tl, 
-T2, -T3, -T4, -T6, -T7, and -Tl 1 for baculo-virus mediated expression in insect cells have 
been described in detail previously ^»^»*^^2-34 jj^g.^^gged soluble expression constructs for 
all human ppGalNAc-transferases, including novel genes designated GalNAc-T12, -T13, - 

10 T14, -T15, and -T16, were prepared using PGR primers as listed in Table I below. 
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TABLE I. Primers used for PGR of s luble secreted GalNAc-transferase expression constructs. 



GalNAc-Tl: 

EBHC121H: 
EBHC107B: 

GalNAc-T2: 

EBHC75D: 
EBHC68: 

GalNAc-T3: 

EBHC219H: 
EBHC215: 

GaINAc-T4: 

EBHC318: 
EBHC307: 

GalNAc-T5: 

EBHC909 : 
EBHC907: 

GaINAc-T6: 

EBHC514H: 
EBHC511N: 

GalNAc-T7: 

EBHC1122H: 
EBHC1116H: 

GalNAc-T8: 

EBHC1820: 
EBHC1821: 

GalNAc-T9: 

EBHC1320 : 
EBHC1321: 

GalNAc-TlO: 

EBHC2520: 
EBHC2521: 

GalNAc-Tll: 

EBHC629: 
EBHC614: 

GalNAc-T12: 

EBHC1051: 
EBHC1032: 

GalNAc-T13: 

EBHC2000: 
EBHC2002 : 

GalNAc-T14: 

EBHC1720: 
EBH21721: 

GalNAc-T15: 

EBHC412: 
EBHC438: 

GaINAG-T16: 

EBHC1913: 
EBHC1912: 



5' -GCGGGATCCAGGACTTCCTGCTGGAGATG-3' 
5' -GCGGATCCTCAGAATATTTCTGGAAGGG-3' 



5' -GCGGAATTCTTAAAAAGAAAGACCTTCATCACAGC-3' 
5' -GCGGAATTCCTACTGCTGCAGGTTGAGC-3' 



5' -GCGGGATCCAACGATGGAAAGGAACATG-3' 

5 ' -AGCGGATCCAGGAACACTTAATCATTTTGGC-3 ' 



5 ' -GCGGGATCCTTTTCATGCCTCCGCAGGAGCC- 3 ' 
5' -GCGGGATCCGACGAAAGTGCTGTTGTGCTC-3' 



5 ' -GCGGGATCCTGCTTTAACTGGAGGGCTAGAGC- 3 ' 
5 ' -GCGGGATCCATCAGTTACACTTCAGGCTTC-3 ' 



5 ' -GCGGGATCCCCTGGACCTCATGCTGGAGGCCATG-3 ' 
5 ' -AGCGGATCCTGGGGATGATCTGGGTCCTAGAC- 3 ' 



5' -GCGAAGCTTCAGGATGAGGGAAGACAGAGATG 

5 ' -GCGAAGCTTCTCTCTAAACACTATGGATCTTATTC-3 ' 



5 ' -GCGGGATCCTCTGAAAGAAAGTATGAAATTAGC-3 ' 
5' -GCGGGATCCTCACTGGCTGTTGGTCTGACC 



5 ' -GCGGGATCCCTGCCGCCTGCAGGGCCGCTCCCAG-3 ' 
5' -GCGGGATCCTCAGTGCCGTCGGTGTTTGATCC-3' 



5 ' -GCGGGATCCCCGCGAGCGGCAGCCCGACGGC-3 ' 
5 ' -GCGGGATCCTCAGTTCCTATTGAATTTTTC-3 ' 



5 ' -GCGAATTCGTGAAGTGACTCAGCCACTTAAG- 3 ' 
5' -GCGAATTCGTCTCTGTCAGACACGTGTC-3' 



5' -GCGGGATCCGGCTCGGTGCTGCGGGCGCAGCG-3 ' 

5' -GCGGGATCCTCATAACATGCGCTCTTTGAAGAACC-3' 



5' -GCGGGATCCGATGTTGCACVVTCCCCACCACACC-3' 
5' -GCGGGATCCTCATCGTTCATCCACAGCATTG-3' 



5 ' -GCGGGATCCTCTGCTGCCTGCATTGAGGGCTG'3 ' 
5' -GCGGGATCCTCATGTGCCCAAGGTCATGTTCC-3' 



5' -GCGGGATCCCAAGAGGAAGTTGGAGGTGCCG-3' 
5 ' -GCGGGATCCCAGGGGTCCTCAAGAGCTCACC-3 ' 



5' -GCGGGATCCCTACTACTTATGGCAGGACAACCG-3' 
5 ' -GCGTCATGTGTGTGGCAACAGCTGCCACTG - 3 ' 
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Expression constructs were amplified by PCR using 20 ng plasmid DNA as 
template. Expand High Fidelity-kit (Roche) was used as recommended by the manufacturer 
using an ABI2700 thermocycler (Applied Biosystems). Products were digested with EcoRl 
(GalNAc-T2, -Til, -T12 and -T16), BarriHl (GalNAc-Tl, -T3, -T4, -T5, -T6, -T8, -T9, - 
5 TIO, -T13, -T14 and -T15) and Hindm (GalNAc-T7), and sub-cloned into the EcoRI or 
Hindm site of pBKS-HistagI or the BamHI site of pBKS-HistagH. PBKS-Histag-I and -H 
vectors were generated from pBluescrip (Stratagene), by inserting a fragment encoding 
6xHis, a thrombin cleavage site, and a T7 antibody site. pBKS-Histag-I was modified with 
the sequence 

10 5'-GCGGCCGCTCTAGAACTAGTGGATCCAGCAGCCATCATCATCATCATCACAGCAGCGGCC 
TGGTGCCGCGCGGCAGCCATATGGCTAGCATGACTGGTGGACAGCAAATG^ 
TATCAAGCTTATCGATACCGTCGACCTCGAG-3 ' . 

pBBCS-Histag-n was modified with the sequence: 

5 ' - GAATTCGCGGCCGCAGCAGCCATCATCATCATCATCACAGCAGCGGCCTGGTGCCGCGCG 
1 5 GCAGCCATATGGCTAGCATGACTGGTGGACAGCAAATGGATCCACTAGTTCTAGAGCGGCCGC -3 ' . 

All construct were fiiUy sequenced. His-tagged GalNAc-transferase pBKS-HIS-tag- 
I constructs were excised with NotI and Xhol (blunt-ended) or as for GalNAc-Tl 1, with 
NotI and Hindlll (blunt-ended) and sub-cloned into the Notl/Bgin (blunt-ended) site of the 
pAcGP67A Baculo expression vector (Pharmingen). His-tagged GalNAc-transferase 
20 pBKS-HIS-tag-II constructs were excised with NotI and inserted into the NotI site of 
pAcGP67A Baculo expression vector. 

The coding region for human polypeptide GalNAc-T12 has been submitted to 
GenBank/EBI Data Bank and assigned accession number AJ 132365: 

Human GalNAc-T12 DNA sequence: 

25 ATGTGGGGGCGCACGGCGCGGCGGCGCTGCCCGCGGGAACTGCGGCGCGGCCGGGAGGCGCTGTTGGTGCTCC 
TGGCGCTACTGGCGTTGGCCGGGCTGGGCTCGGTGCTGCGGGCGCAGCGTGGGGCCGGGGCCGGGGCTC 
GCCGGGACCCCCGCGCACCCCGCGCCCCGGGCGGCGCGAGCCGGTCATGCCGCGGCCGCCGGTGCCGGCGAAC 
GCGCTGGGCGCGCGGGGCGAGGCGGTGCGGCTGCAGCTGCAGGGCGAGGAGCTGCGGCTGCAGGAGGAGAGCG 
TGCGGCTGCACCAGATTAACATCTACCTCAGCGACCGCATCTCACTGCACCGCCGCCTGCCCGAGCGCTGGAA 

30 CCCGCTGTGCAAAGAGAAGAAATATGATTATGATAATTTGCCCAGGACATCTGTTATCATAGCATTTTATAAT 
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GAAGCCTGGTCAACTCTCCTTCGGACAGTTTACAGTGTCCTTGAGACATCCCCGGATATCCTGCTAGAAGAAG 
TGATCCTTGTAGATGACTACAGTGATAGAGAGCACCTGAAGGAGCGCTTGGCCAATGAGCTTTCGGGACTC 
CAAGGTGCGCCTGATCCGCGCCAACAAGAGAGAGGGCCTGGTGCGAGCCCGGCTGCTGGGGGCGTCT^^ 
AGGGGCGATGTTCTGACCTTCCTGGACTGTCACTGTGAATGCCACGAAGGGTGGCTGGAGCCG 
5 GGATCCATGAAGAGGAGTCGGCAGTGGTGTGCCOSGTGATTGATGTGATCGACTGGAACT^CCTTCG 
GGGGAACTCCGGGGAGCCCCAGATCGGCGGTTTOSACTGGAGGCTGGTGTTC^^ 

AGGGAGAGGATACGGATGCAATCCCCCGTCGATGTCATCAGGTCTCCAACAATGGCTGGTGGGCTGTTTGCTC 
TGAGTAAGAAATATTTTGAATATCTGGGGTCTTATGATACAGGAATGGAAGTTTGGGGAGGAGAA^ 
ATTTTCCTTTAGGATCTGGCAGTGTGGTGGGGTTCTGGAAACACACCCATGTTCCCATGTO 
10 CCCAAGCAAGCTCCCTACTCCCGCAACAAGGCTCTGGCCAACAGTGTTCGTGCAGCTGAAGTATGGATGGATG 

AATTTAAAGAGCTCTACTACCATCGCAACCCCCGTGCCCGCTTGGAACCTTTTGGGGATGTGACAGAGAGGAA 
GCAGCTCCGGGACAAGCTCCAGTGTAAAGACTTCAAGTGGTTCTTGGAGACTGTGTATCCAGAACTGCATGTG 
CCTGAGGACAGGCCTGGCTTCTTCGGGATGCTCCAGAACAAAGGACTAACAGACTACTGCTTTGACTATAACC 
CTCCCGATGAAAACCAGATTGTGGGACACCAGGTCATTCTGTACCTCTGTCATGGGATGGGCCAGAATCAGTT 
15 TTTCGAGTACACGTCCCAGAAAGAAATACGCTATAACACCCACCAGCCTGAGGGCTGCATTGCTGTGGAAGCA 
GGAATGGATACCCTTATCATGCATCTCTGCGAAGAAACTGCCCCAGAGAATCAGAAGTTCATCTTGCAGGAGG 
ATGGATCTTTATTTCACGAACAGTCCAAGAAATGTGTCCAGGCTGCGAGGAAGGAGTCGAGTGACAGTT^^ 
TCCACTCTTACGAGACTGCACCAACTCGGATCATCAGAAATGGTTCTTCAAAGAGCGCATGTTATGA 

Human GalNAc-T12 amino acid sequence: 

20 MWGRTARRRCPRELRRGREALLVLLALLALAGLGSVLRAQRGAGAGAAEPGPPRTPRPGRREPVM^ 
RPPVPANALGARGEAWLQLQGEELRLQEESVRLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDNLPRTS 
VIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLANELSGLPKVRLIRA^ 
LLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEEESAWCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVF 
TWHTVPERERIRMQSPVDVIRSPTMAGGLFAVSKKYFEYIJGSYDTGMEWGGENLEFSFRIWQCGG^^ 

25 SHVGHFSPSKLPTPRNKALANSVRAAEVWMDEFKELYYHRNPRARLEPFGDVT^^ 

VYPELHVPEDRPGFFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEIRYNTHQPE 

GCIAVEAGl^TLIMHLCEETAPENQKFILQEDGSLFHEQSKKCVQAARKESSDSFVPLLRDCTNSDHQKW 

ERML 

The coding region for human polypeptide GalNAc-T13 has been submitted to 
30 GenBank/EBI Data Bank and assigned accession number ARl 53422 : 
Himian GalNAc-T13 DNA sequence: 

ATGCTCCTAAGGAAGCGATACAGGCACAGACCATGCAGACTCCAGTTCCTCCTGCTGCTCCTGATG 
CTGGGATGCGTCCTGATGATGGTGGCGATGTTGCACCCTCCCCACa^CACCCTGCACCAG 
AAGCCAGCAAGCACAGCCCTGAAGCCAGGTACCGCCTGGACTTTGGGGAATCCCAGGATTC^ 
35 TGAGGATGAGGGTGAAGAGTACAGCCCTCTGGAGGGCCTGCCACCCTTTATCTCACTGCGGGAGGATC^ 
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CTGGTGGCCGTGGCCTTACCCCAGGCCAGAAGGAACCAGAGCCAGGGCAGGAGAGGTGGGAGCTACCGCCTCA 
TCTW^GCAGCCAAGGAGGCAGGATAAGGAAGCCCCT^GAGGGA 

TGAAGAAGAGGAGTTGACCCCGTTCAGCCTGGACCCACGTGGCCTCCAGGAGGCACTCAGTGCCCGCATCCCC 
CTCCAGAGGGCTCTGCCCGAGGTGCGGCACCCACTGTGTCTGCAGCAGCACCCTCAGGACAGCCTGCCCACAG 
5 CCAGCGTCATCCTCTGTTTCCATGATGAGGCCTGGTCCACTCTCCTGCGGACTGTACACAGCATCCTCGACAC 
AGTGCCCAGGGCCTTCCTGAAGGAGATCATCCTCGTGGACGACCTCAGCCAGCT^GGACAACTCA^ 
CTCAGCGAATATGTGGCCAGGCTGGAGGGGGTGAAGTTACTCAGGAGCAACAAGAGGCTGGGTC 
CCCGGATGCTGGGGGCCACCAGAGCCACCGGGGATGTGCTCGTCTTCATGGATGCCCACTGCGAGTGCCACCC 
AGGCTGGCTGGAGCCCCTCCTCAGCAGAATAGCTGGTGACAGGAGCCGAGTGGTATCTCCGGTGATAGATGTG 

1 0 ATTGACTGGAAGACTTTCCAGTATTACCCCTCAAAGGACCTGC AGCGTGGGGTGTTGGACTGGA^ 

TCCACTGGGAACCTTTGCCAGAGCATGTGAGGAAGGCCCTCCAGTCCCCCATAAGCCCCATCAGGAGCCCTGT 
GGTGCCCGGAGAGGTGGTGGCCATGGACAGACATTACTTCCAAAACACTGGAGCGTATGACTCTCTTATGTCG 
CTGCGAGGTGGTGAAAACCTCGAACTGTCTTTCAAGGCCTGGCTCTGTGGTGGCTCTGTTGAAATCCTTCCCT 
GCTCTCGGGTAGGACACATCTACCAAAATCAGGATTCCCATTCCCCCCTCGACCAGGAGGCCACCCTGAGGAA 

15 CAGGGTTCGCATTGCTGAGACCTGGCTGGGGTCATTCAAAGAAACCTTCTACAAGCATAGCCCAGAGGCCTTC 
TCCTTGAGCAAGGCTGAGAAGCCAGACTGCATGGAACGCTTGCAGCTGCAAAGGAGACTGGGTTGTCGGACAT 
TCCACTGGTTTCTGGCTAATGTCTACCCTGAGCTGTACCCATCTGAACCCAGGCCCAGTTTCTCTGGAAAGCT 
CCACAACACTGGACTTGGGCTCTGTGCAGACTGCCAGGCAGAAGGGGACATCCTGGGCTGTCCCATGGTG^ 
GCTCCTTGCAGTGACAGCCGGCT^GCAACAGTACCTGCAGCACACCAGCAGGAAGGAGATTCTVCTTT^ 

20 CACAGCACCTGTGCTTTGCTGTCAGGCAGGAGCAGGTGATTCTTCAGAACTGCACGGAGGAAGGCCTGGCCA^ 
CCACCAGCAGCACTGGGACTTCCAGGAGAATGGGATGATTGTCCACATTCTTTCTGGGAAATGCATGG 
GTGGTGCAAGAAAACAATAAAGATTTGTACCTGCGTCCGTGTGATGGAAAAGCCCGCCAGCAGTGGCGTTTTG 
ACCAGATCAATGCTGTGGATGAACGATGA 

Human GalNAc-T13 amino acid sequence: 

25 MLLRKRYRHRPCRLQFLLLLLMLGCVLMMVAMLHPPHHTLHQTVTAQASKHSPEARYRLDFGESQDWVLEA 
EDEGEE YS PLEGLPPF I S LREDQLLVAVALPQARRNQS QGRRGGS YRL I KQPRRQDKEAPKRDWGADEDGE 
VSEEEELTPFSLDPRGLQEALSARIPLQRALPEVRHPLCLQQHPQDSLPTASVILCFHDEAWSTLLRTVHS 
ILDTVPRAFLKEIILVDDLSQQGQLKSALSEYVARLEGVKLLRSNKRLGAIRARMLGATRATGDVLVFMDA 
HCECHPGWLEPLLSRIAGDRSRVVSPVIDVIDWKTFQYYPSKDLQRGVLDWKLDFHWEPLPEHVRKALQSP 

30 ISPIRSPWPGEWAMDRHYFQNTGAYDSLMSLRGGENLELSFKAWLCGGSVEILPCSRVGHIYQNQDSHS 
PLDQEATLRNRVRIAETWLGSFKETFYKHSPEAFSLSKAEKPDCMERLQLQRRLGCRTFHWFLANVYPELY 
PSEPRPSFSGKLHNTGLGLCADCQAEGDILGCPMVLAPCSDSRQQQYLQHTSRKEIHFGSPQHLCFAVRQE 
QVILQNCTEEGLAIHQQHWDFQENGMIVHILSGKCMEAWQENNKDLYLRPCDGKARQQWRFDQINAVDER 

The coding region for human polypeptide GalNAc-T14 has been submitted to 
35 GenBank/EBI Data Bank and assigned accession number AJ505991 : 
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Human GalNAc-T14 DNA sequence: 

ATGAGGAGATTTGTCTACTGCAAGGTGGTTCTAGCCACTTCGCTGATGTGGGTTCTTGTTGATGTC 
TTCTTACTGCTGTACTTCAGTGAATGTAACAAATGTGATGACAAGAAGGAGAGATCTCTGCTGCCTGCATTGA 
GGGCTGTTATTTCAAG7\AACCAAGAAGGGCCAGGAGAAATGGGAAAAGCTGTGTTGATTCCT 
5 GGAGAAAATGAAAGAGCTGTTTAAAATCTUITCAGTTTAACCTTATGGCCAGTGATTTGAT^ 

AGTCTGCCAGATGTAAGATTAGAAGGATGTAAGACAAAAGTCTACCCTGATGAACTTCCAT^CACAAGTGTAG 
TCATTGTGTTTCATAATGAAGCTTGGAGCACTCTCCTTAGAACTGTTTACAGTGTGATAAAT^^^ 
CTATCTACTCTCAGAGGTCATCTTGGTAGATGATGCCAGTGAAAGAGATTTTCTCAAGTTGACATTAGAGAAT 
TACGTGAAAAATTTAGAAGTGCCAGTAAAAATTATTAGGATGGAAGAACGCTCTGGGTTAATACGTGCCCGTC 

1 0 TTCGAGGAGC7^GCTGCTTCAAAAGGGCAGGTCATAACTTTTCTTGATGCACACTGTGAATGCACGTTA(^ 
GCTGGAGCCTTTGCTGGCAAGAATAAAGGAAGACAGGAAAACGGTTGTCTGCCCTATCATTGATGTGAOT 
GATGATACTTTTGAATATATGGCTGGGTCAGACATGACTTATGGGGGTTTTAACTGGAAACTC 
GGTATCCTGTTCCCCAAAGAGAAATGGACAGGAGGAAAGGAGACAGAACATTACCTGTCAGGACCCCTACT^ 
GGCTGGTGGCCTATTTTCTATTGACAGAAACTACTTTGAAGAGATAGGAACTTACGATGCAGGAATGGA 

15 TGGGGTGGAGAGAATCTTGAAATGTCTTTTAGGATTTGGCAATGTGGAGGCTCCTTGGAGATTGTTACTTGCT 
CCCATGTTGGTCATGTTTTTCGGAAGGCAACTCCATACACTTTTCCTGGTGGCACTGGTCATGTCATCAACAA 
GAACAACAGGAGACTGGCAGAAGTTTGGATGGATGAATTTAAAGATTTCTTCTACATCATATCCCCAGGTGTT 
GTCAAAGTGGATTATGGAGATGTGTCAGTCAGAAAAACACTAAGAGAAAATCTGAAGTGTAAGCCCTTTTCTT 
GGTACCTAGAAAACATCTATCCGGACTCCCAGATCCCAAGACGTTATTACTCACTTGGTGAGATAAGAAATGT 

20 TGAAACCAATCAGTGTTTAGACAACATGGGCCGCAAGGAAAATGAAAAAGTGGGTATATTCAACTGTCATGGT 
ATGGGAGGAAATCAGGTATTTTCTTACACTGCTGACAAAGAAATCCGAACCGATGACTTGTGCTTGGATG^^ 
CTAGACTCAATGGACCTGTAATCATGTTAAAATGCCACCATATGAGAGGAAATCAGTTATGGGAATATGATGC 
TGAGAGACTCACGTTGCGACATGTT/^CAGTAACCAATGTCTCGATGAACCTTCTGAAGAAGAC^^ 
CCTACAATGCAGGACTGTAGTGGAAGCAGATCCC7UVCAGTGGCTGCTAAGGAACATGACCTTGGGCACA 

25 Human GalNAc-T14 amino acid sequence: 

MRRFVYCK\A^TSLMWVLVDVFLLLYFSECNKCDDKKERSLLPALRAVISRNQEGPGEMGKAVLI 
PKDDQEKMKELFKINQFNLMASDLIAIiNRSLPDVRLEGCKTK\nrPDELPNTSWIVFH^ 
NRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQV^ 
CTLGWLEPLLARIKEDRKTWCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMD^^ 
30 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTG 
HVINKNNRRLAEVWMDEFKDFFYIISPGVVKVDYGDVSWKTLRENLKCK^ 
EIRl^TNQCLDimGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRI^ 
WEYDAERLTLRHVNSNQCLDEPSEEDKMVPTMQDCSGSRSQQWLLRNMTLGT 

The coding region for human polypeptide GalNAc-T15 has been submitted to 
35 GenBank/EBI Data Bank and assigned accession number Y09324: 
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Human GalNAc-T15 DNA sequence: 

ATGCGGCGCCTGACTCGTCGGCTGGTTCTGCCAGTCTTCGGGGTGCTCTGGATCACGGTGCTGCTG 
TTCTTCTGGGTAACCAAGAGGAAGTTGGAGGTGCCGACGGGACCTGAAGTGCAGACCCCTAAGCCTTCGGACG 
CTGACTGGGACGACCTGTGGGACCAGTTTGATGAGCGGCGGTATCTGAATGCCAAAAAGTGGCGCGTTGGTGA 
5 CGACCCCTATAAGCTGTATGCTTTCAACCAGCGGGAGAGTGAGCGGATCTCCAGCAATCGGGCCATCCCGGAC 
ACTCGCCATCTGAGATGCACACTGCTGGTGTATTGCAaSGACCTTCCA^ 

ACAACGAAGCCCGCTCCACGCTGCTCAGGACCATCCGCAGTGTATTAAACCGCACCCCTACGCATCrGATCCG 

GGAAATCATATTAGTGGATGACTTCAGCAATGACCCTGATGACTGTAAACAGCTCATCAAATTGCCC^ 

AAATGCTTGCGCAATAATGAACGGCAAGGTCTGGTCCGGTCCCGGATTCGGGGCGCTGACATCGCCCAGGGCA 

10 CCACTCTGACTTTCCTCGACAGCCACTGTGAGGTGAACAGGGACTGGCTCCTVGCCTCTGTTGCACA^ 

AGAAGACTACACGCGGGTGGTGTGCCCTGTGATCGATATCATTAACCTGGACACCTTCACCTACATCGAGTCT 
GCCTCGGAGCTCAGAGGGGGGTTTGACTGGAGCCTCCACTTCC7VGTGGGAGCAGCTCTCCCCAGAGCAGAA 
TCGGCGCCTGGACCCCACGGAAGCCCATCAGGACTCCTATCATAGCTGGAGGGCTCTTCGTGATCGACAAAGC 
TTGGTTTGATTACCTGGGGAAATATGATATGGACATGGACATCTGGGGTGGGGAGAACTTTGAAATC 

15 CGAGTGTGGATGTGCGGGGGCAGCCTAGAGATCGTCCCCTGCAGCCGAGTGGGGCACGTCTTCCGGAAGAAGC 
ACCCCTACGTTTTCCCTGATGGAAATGCCTU^CTi^CGTATATAAAGAACACCAAGCGGACAGCTG^ 
GGATGAATACAAGCAATACTATTACGCTGCCCGGCCATTCGCCCTGGAGAGGCCCTTCGGGAATGTTGAGAGC 
AGATTGGACCTGAGGAAGAATCTGCGCTGCCAGAGCTTCAAGTGGTACCTGGAGAATATCTACCCTGAACTCA 
GCATCCCCAAGGAGTCCTCCATCCAGAAGGGCAATATCCGACAGAGACAGAAGTGCCTGGAATCTCAAAGGCA 

20 GAACAACCAAGAAACCCCAAACCTAAAGTTGAGCCCCTGTGCCAAGGTCAAAGGCGAAGATGCAAAGTCCCAG 
GTATGGGCCTTCACATACACCCAGCAGATCCTCCAGGAGGAGCTGTGCCTGTCAGTCATCACCTTGTTCCCTG 
GCGCCCCAGTGGTTCTTGTCCTTTGCAAGAATGGAGATGACCGACAGCAATGGACCAAAACTGGTTCCCACAT 
CGAGCACATAGCATCCCACCTCTGCCTCGATACAGATATGTTCGGTGATGGCACCGAGAACGGCAAGGAAATC 
GTCGTCAACCCATGTGAGTCCTCACTCATGAGCCAGCACTGGGACATGGTGAGCTCTTGAGGACCCCTGCCAG 

25 AAGCAGCAAGGGCCATGGGGTGGTGCTTCCCTGGACCAGAACAGACTGGAAACTGGGCAGCAAGCAGCCT^ 
ACCACCTCAGACATCCTGGACTGGGAGGTGGAGGCAGAGCCCCCCAGGACAGGAGCAACTGTCTaVGG^ 
CAGAGGAAAACTVTCACAAGCCT^TGGGGCTCAAAGACAAATCCCACATGTTCTCAAGGCCGTTAAGTTC^ 
CCTGGCCAGTCATTCCCTGA 

Human GalNAc-T15 amino acid sequence: 

30 l^RLTRRLVLPVFGVLWITVLLFFWVTKRKLEVPTGPEVQTPKPSDADWDDLWDQFDERRYLNAKK 
WRVGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLWCTDLPPTSIIITFHNEARSTLLRTIRSVL^TP 
THLIREIILVDDFSNDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPL 
LHRVKEDYTRWCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKLGAWTPRKPIRTPIIAGGLF 
VIDKAWFDYLGKYDlffiMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKN 

35 AEVWIDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIP^ 
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ESQRQNNQETPNLKLSPCAKVKGEDAKSQWAFTYTQQILQEELCLSVITLFPGAPVVLVLCKNGDDRQQWTO 
TGSHIEHIASHLCLDTDMFGDGTENGKEIWNPCESSLMSQHWDMVSS 

The coding region for human polypeptide GalNAc-T16 has been submitted to 
GenBank/EBI Data Bank and assigned accession number AJ505951 : 
5 Human GalNAc-T 1 6 DNA sequence: 

ATGAGGAAGATCCGCGCCAATGCCATCGCCATCCTGACCGTAGCCTGGATCCTGGGCACTTTCTAC 
TACTTATGGCAGGACAACCGAGCCCACGCAGCATCCTCCGGCGGCCGGGGCGCGCAGAGGGCAGGCAGGAGGT 
CGGAGCAGCTCCGCGAGGACCGCACCATCCCGCTCATTGTGACAGGAACTCCCTCGAAAGGCTTTGATGAGAA 
GGCCTACCTGTCGGCCAAGCAGCTGAAGGCTGGAGAGGACCCCTACAGACAGCACGCCTTCAACCAGCTGGAG 

10 AGTGACAAGCTGAGCCCAGACCGGCCCATCCGGGACACCCGCCATTACAGCTGCCCATCTGTGTCCTACTCCT 
CGGACCTGCCAGCCACa\GCGTCATCATCACCTTCCACAATGAGGCCCGTTCCACCCTGCTGCGCACAGTC 
GAGTGTCCTGAACCGAACTCCTGCCAACTTGATCCAGGAGATCATTTTAGTGGATGACTTCAGCTCAGATCCG 
GAAGACTGTCTACTCCTGACCAGGATCCCCAAGGTCAAGTGCCTGCGCAATGATCGGCGGGAAGGGCTGATCC 
GGTCCCGAGTGCGTGGGGCGGACGTGGCTGCAGCTACCGTTCTCACCTTTCTGGATAGCCACTGCGAAGTGAA 

15 CACCGAGTGGCTGCCGCCCATGCTGCAGCGGGTGAAGGAGGACCACACCCGCGTGGTGAGTCCCATCATTGAT 
GTCATCAGTCTGGATAATTTTGCCTACCTTGCAGCATCTGCTGACCTTCGTGGAGGGTTCGACTGGAGCCTGC 
ATTTCT^GTGGGAGCAGATCCCTCTTGAGCAGAAGATGACCCGGACAGACCCCACCAGGCCCATAAGGAC^^ 
TGTCATAGCTGGAGGAATCTTCGTGATCGACAAGTCCTGGTTTAACCACTTGGGAAAGTATGATGCCCAGATG 
GACATCTGGGGGGGAGAGAATTTTGAGCTCTCCTTCAGGGTGTGGATGTGTGGTGGCAGTCTGGAGATCGTCC 

20 CCTGCAGCCGGGTGGGCCATGTCTTCAGGAAACGGCACCCCTACAACTTCCCTGAGGGTAATGCCCTCACCTA 
CATCAGGAATACTAAGCGCACTGCAGAAGTGTGGATGGATGAATACAAGCAATACTACTATGAGGCCCGGCCC 
TCGGCCATCGGGAAGGCCTTCGGCAGTGTGGCTACGCGGATAGAGCAGAGGAAGAAGATGAACTGCAAGTCCT 
TCCGCTGGTACCTGGAGAACGTCTACCCAGAGCTCACGGTCCCCGTGAAGGAAGCACTCCCCGGCATCATTAA 
GCAGGGGGTGAACTGCTTAGAATCTCAGGGCCAGAACACAGCTGGTGACTTCCTGCTTGGAATGGGGATCTGC 

25 AGAGGGTCTGCCAAGAACCCGCAGCCCGCCCAGGCATGGCTGTTCAGTGACCACCTCATCCAGCAGCAGGGGA 
AGTGCCTGGCTGCCACCTCCACCTTAATGTCCTCCCCTGGATCCCCAGTCATACTGCAGATGTGCAACCCTAG 
AGAAGGCAAGCAGAAATGGAGGAGAAAAGGATCTTTCATCCAGCATTCAGTCAGTGGCCTCTGCCTGGAGACA 
AAGCCTGCCCAGCTGGTGACCAGCAAGTGTCAGGCTGACGCCCAGGCCCAGCAGTGGCAGCTGTTGCCACACA 
CATGA 

30 Human GalNAc-Tl 6 amino acid sequence: 

MRKIRANAIAILTVAWILGTFYYLWQDNRAHAASSGGRGAQRAGRRSEQLREDRTIPLIVTGTPSK 
GFDEKAYLSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVIITFHNEARSTL 
LRTVKSVLNRTPANLIQEIILVDDFSSDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAA^ 
HCEVNTEWLPPMLQRVKEDHTRWSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKI^RTDPT^ 
35 PIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFPEG 
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NALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQRKKl^CKSFRWYLEr^ 

PGIIKQGVNCLESQGQNTAGDFLLGMGICRGSAKNPQPAQAWLFSDHLIQQQGKCLAATSTLMSSPGSPVILQ 

MCNPREGKQKWRRKGSFIQHSVSGLCLETKPAQLVTSKCQADAQAQQWQLLPHT 

Additional homologous polypeptide GalNAc-transferase genes have been identified 
and cloning and expression are in progress, and it follows from the descriptions that similar 
methods as outlined above will yield soluble secreted proteins for study. Expression 
constructs may have immunoaffinity tags or purification tags at the AT-terminal and/or C- 
terminal region. These may include myc, FLAG, HIS, GST, and other (Stratagene, Qiagen, 
Amersham Biosciences). 

Soluble secreted expression constructs of GalNAc-transferase lectin domains were 
prepared from the GalNAc-transferase expression constructs described above by PGR 
using primer pairs as Usted Table n below. 
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Table II. Primers used for amplification of GalNAc-transferase lectin domains 



GalNAc-Tl lectin domain: 

T 1 LEG FOR : 5 ' - CAAAGGAAGCTTATGGAGATATATCGTCAAGAG - 3 ' 

Tl LECRE V : 5 ' - GCAAGCTCGAGGCGGCCGCTCAGAATATTTCTGGAAGGGTGAC - 3 ' 

GalNAc-T2 lectin domain: 

T2 LECFOR : 5 ' - CAAGGAAGCTTCTTATGGAAATATTCAGAGCAGATTG - 3 ' 
T2 LECREV : 5 ' -GCAAGCTCGAGGCGGCCGCCTACTGCTGCAGGTTGAGC - 3 ' 

GalNAc-T3 lectin domain: 

T3 LECFOR : 5 ' - CAAGGAAGCTTCATTTGGTGATCTTTCAAAAAGATTT - 3 ' 
T3 LECREV : 5 ' - GCAAGCTCGAGGCGGCCGCAGGAACACTTAATCATTTTGG - 3 ' 

GalNAc-T4 lectin domain: 

T4 LECFOR : 5 ' - AGAAAAGAAGCTTATGGTGATATTTCTG - 3 ' 
EBHC307: 5 ' -AGCGGATCCGACGAAGTGCTGTTGTGCT -3' 

GalNAc-T5 lectin domain: 

T5LECF0R: 5 ' -CAAGGAAGCTTTAGATGTTGGCAACCTCACCCAGC-3 ' 
T5LECREV : 5 ' - GCAAGCTCGAGGCGGCCGCAAGCATCAGTTACACTTCAGGCTTC - 3 ' 

GalNAc-T6 lectin domain: 

T6LECF0R : 5 ' -CAAGGAAGCTTCCTTCGGTGACATTTCGGAACG-3 ' 
T6LECREV : 5 ' -GCAAGCTCGAGGCGGCCGCTGGGTCCTA6ACAAAGAGCC-3 ' 

GalNAc-T7 lectin domain: 

T7LECF0R : 5 ' -AGAAAAGAAGCTTATGGGGATATATCGGAGCTG-3 ' 

T7LECREV : 5 ' -GCAAGCTCGAGGCGGCCGCTCTCTAAACACTATGGATGTTATTC-3 ' 

GalNAc-T8 lectin domain: 

T8 LECFOR : 5 ' - CAAGGAAGCTTTTGGAGACGTTTCTTCCAGAATG - 3 ' 

T8 LECREV : 5 ' - GCAAGCTCGAGGCGGCCGCTCACTGGCTGTTGGTCTGACCCC - 3 ' 

GalNAc-T9 lectin domain: 

T9 LECFOR ; 5 ' - CAAGGAAGCTTTCGGGGACGTGTCTGAGAGGCTG - 3 ' 

T 9 LECREV : 5 ' - GCAAGCTCGAGGCGGCCGCTCAGTGCCGTGCGTGTTTGATCC - 3 ' 

GalNAc-TlO lectin domain: 

TIOLECFOR : 5 ' -CAAGGAAGCTTCCGCTGGGGATGTCGCAGTCCAG-3 ' 

T 1 0 LECREV : 5 ' - GCAAGCTCGAGGCGGCCGCTCAGTTCCTATTGAATTTTTCC - 3 ' 

GalNAc-Tl 1 lectin domain: 

Tl 1 LECFOR : 5 ' - CAAGGAAGCTTGCAATATCAGTGAGCGTGTGG - 3 ' 
TllLECREV : 5 ' -GCAAGCTCGAGGCGGCCGCCCACCTTAACCTTCCAAATGC- 3 ' 

GalNAc-T12 lectin domain: 

T12LECF0R : 5 ' -CAAGGAAGCTTGGGATGTGACAGAGAGGAAG-3 ' 

Tl 2 LECREV : 5 ' - GCAAGCTCGAGGCGGCCGCTCATAACATGCGCTCTTTGAAGAACC - 3 ' 

GalNAc-T13 lectin domain: 

T13LECF0R : 5 ' -CAAGGAAGCTTCTGAGAAGCCAGACTGCATGG-3 ' 
T13LECREV: 5 , -GCAAGCTCGAGGCGGCCGCTCATCGTTCATCCACAGCATTG-3 ' 

GalNAc-T14 lectin domain: 

Tl 4 LECFOR : 5 ' - CAAGGAAGCTTATGGAGATGTGTCAGTCAGAAAAAC - 3 ' 

Tl 4 LECREV : 5 ' -6CAAGCTCGAGGCGGCCGCTCATGTGCCCAAGGTCATGTTCC - 3 ' 

GalNAc-T15 lectin domain: 

T15LECF0R: 5' 'CAAGGAAGCTTTCGGGAATGTTGAGAGCAGATTG~3 ' 
. T15LECREV: 5' -GCAAGCTCGAGGCGGCCGCTCAAGAACTCACCATGTCCCAGTG-3 ' 

GalNAc-T16 lectin domain: 

Tl 6 LECFOR : 5 ' - CAAGGAAGCTTGCAGTGTGGCTACGCGGATAGAGCAGAGG - 3 ' 
Tl 6 LECREV : 5 ' - GCAAGCTCGAGGCGGCCGCTCATGTGTGTGGCAACAGCTGCC - 3 ' 

PCR amplifications were perfomied with 10 ng GalNAc-transferase plasmid DNA 
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as template and High Fidelity PGR kit (Roche) with conditions recommended by the 
manufacturer. Amplified products were digested with HindUl mdXhol and inserted into 
the Hindin/Xhol site of pBKS-Histagl. All constructs were fiiUy sequenced. Tagged lectin 
domain constructs were excised with NotI and sub-cloned into the Notl site of pAcGP67-A 
5 Baculo expression vector. 

The exact borders of the lectin domains and the catalytic units have not been 
defined, but multiple sequence alignment analysis (Fig. 2) was used to predict the most 
likely borders and these were used for design of PGR primers as listed in Table 11. DNA 
and amino acid sequences of preferred constructs of GalNAc-transferase lectin domains 
10 and their construct design include the following (Table HI): 

Table III: DNA and amino acid sequences of GalNAc-transferase lectin domains. 
GalNAc-Tl lectin domain; 

15 

The lectin domain polypeptide sequence comprises amino acid residues 393-559 of 
GALNTl (GALNTl nucleotide sequence accession number is AJ505952) 
Tl LECTIN DNA sequence 

AAAGAAGCTTATGGAGATATATCGTCAAGAGTTGGTCTAAGACACAAACTACAATGCAAACCTT 
20 TTTCCTGGTACCTAGAGAATATATATCCTGATTCTCAAATTCCACGTCACTATTTCTCATTGGG 
AGAGATACGAAATGTGGAAACGAATCAGTGTCTAGATAACATGGCTAGAAAAGAGAATGAAAAA 
GTTGGAATTTTTAATTGCCATGGTATGGGGGGTAATCAGGTTTTCTCTTATACTGCCAACAAAG 
AAATTAGAACAGATGACCTTTGCTTGGATGTTTCCAAACTTAATGGCCCAGTTACAATGCTCAA 
ATGCCACCACCTAAAAGGCAACCAACTCTGGGAGTATGACCCAGTGAAATTAACCCTGCAGCAT 
25 GTGAACAGTAATCAGTGCCTGGATAAAGCCACAGAAGAGGATAGCCAGGTGCCCAGCATTAGAG 
ACTGCAATGGAAGTCGGTCCCAGCAGTGGCTTCTTCGAAACGTCACCCTTCCAGAAATATTCTG 
A- stop 

Tl LECTIN Amino acid sequence 

30 YGDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGI 
FNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNS 
NQCLDKATEEDSQVPSIRDCNGSRSQQWLLRNVTLPEIF* 
GalNAc-T2 lectin domain: 

35 The lectin domain polypeptide sequence comprises amino acid residues 408-571 of 

GALNT2 (GALNT2 nucleotide sequence accession number is X85019). 
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T2 LECTIN DNA sequence 

TATCCAGAGTTAAGGGTTCCAGACCATCAGGATATAGCTTTTGGGGCCTTGCAGCAGGGAACTA 
ACTGCCTCGACACTTTGGGACACTTTGCTGATGGTGTGGTTGGAGTTTATGAATGTCACAATGC 
TGGGGGAAACCAGGAATGGGCCTTGACGAAGGAGAAGTCGGTGAAGCACATGGATTTGTGCCTT 
5 ACTGTGGTGGACCGGGCACCGGGCTCTCTTATAAAGCTGCAGGGCTGCCGAGAAAATGACAGCA 
GACAGAAATGGGAACAGATCGAGGGCAACTCCAAGCTGAGGCACGTGGGCAGCAACCTGTGCCT 
GGACAGTCGCACGGCCAAGAGCGGGGGCCTAAGCGTGGAGGTGTGTGGCCCGGCCCTTTCGCAG 
CAGTGGAAGTTCACGCTCAACCTGCAGCA GTAG - s top 

10 T2 LECTIN Amino acid sequence 

YPELRVPDHQDIAFGALQQGTNCLDTLGHFADGWGVYECHNAGGNQEWALTKEKSVKHMDLCL 
TWDRAPGSLIKLQGCRENDSRQKWEQIEGNSKLRHVGSNLCLDSRTAKSGGLSVEVCGPALSQ 
QWKFTLNLQQ* 

15 

GalNAc-T3 lectin domain: 

The lectin domain polypeptide sequence comprises amino acid residues 467-633 of 
GALNT3 (GALNT3 nucleotide sequence accession number is AJ505954). 

20 T3 LECTIN DNA sequence 

TCATTTGGTGATCTTTCAAAAAGATTTGAAATAAAACACCGTCTTCGGTGTAAAAATTTTACAT 
GGTATCTGAACAACATTTATCCAGAGGTGTATGTGCCAGACCTTAATCCTGTTATATCTGGATA 
CATTAAAAGCGTTGGTCAGCCTCTATGTCTGGATGTTGGAGAAAACAATCAAGGAGGCAAACCA 
TTAATTATGTATACATGTCATGGACTTGGGGGAAACCAGTACTTTGAATACTCTGCTCAACATG 

25 AAATTCGGCACAACATCCAGAAGGAATTATGTCTTCATGCTGCTCAAGGTCTCGTTCAGCTGAA 
GGCATGTACCTACAAAGGTCACAAGACAGTTGTCACTGGAGAGCAGATATGGGAGATCCAGAAG 
GATCAACTTCTATACAATCCATTCTTAAAAATGTGCCTTTCAGCAAATGGAGAGCATCCAAGTT 
TAGTGTCATGCAACCCATCAGATCCACTCCAAAAATGGATACTTAGCCAAAATGAT TAA" s top 

30 T3 LECTIN Amino acid sequence 

FGDLSKRFEIKHRLRCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLCLDVGENNQGGKPL 
IMYTCHGLGGNQYFEYSAQHEIRHNIQKELCLHAAQGLVQLKACTYKGHKTWTGEQIWEIQKD 
QLLYNPFLKMCLSANGEHPSLVSCNPSDPLQKWILSQND* 

35 

GalNAc-T4 lectin domain; 

The lectin domain polypeptide sequence comprises amino acid residues 405-578 of 
GALNT4 (GALNT4 nucleotide sequence accession number is Y08564). 

40 T4 LECTIN DNA sequence 

GAGGATAGACCAGGCTGGCATGGGGCTATTCGCAGTAGAGGGATCTCGTCTGAATGTTTAGATT 
ATAATTCTCCTGACAACAACCCCACAGGTGCTAACCTTTCACTGTTTGGATGCCATGGTCAAGG 
AGGCAATCAATTCTTTGAATATACTTCAAACAAAGAAATAAGGTTTAATTCTGTGACAGAGTTA 
TGTGCAGAGGTACCTGAGCAAAAAAATTATGTGGGAATGCAAAATTGTCCCAAAGATGGGTTCC 
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CTGTACCAGCAAACATTATTTGGCATTTTAAAGAAGATGGAACTATTTTTCACCCACACTCAGG 
ACTGTGTCTTAGTGCTTATCGGACACCGGAGGGCCGACCTGATGTACAAATGAGAACTTGTGAT 
GCTCTAGATAAAAATCAAATTTGGAGTTTTGAGAA ATAG - s top 

T4 LECTIN Amino acid sequence 

AYGDISERKLLRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLDYNSPDNNPT 
GANLSLFGCHGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWH 
FKEDGTIFHPHSGLCLSAYRTPEGRPDVQMRTCDALDKNQIWSFEK* 



GalNAc-T5 lectin domain: 

The lectin domain polypeptide sequence comprises amino acid residues 486-653 of 
GALTN5 (GALTN5 nucleotide sequence accession nimiber is AJ505956). 

T5 LECTIN DNA sequence 

TTAGATGTTGGCAACCTCACCCAGCAAAGGGAGCTGCGAAAGAAACTGAAGTGCAAAAGTTTCA 
AATGGTACTTGGAGAATGTCTTTCCTGACTTAAGGGCTCCCATTGTGAGAGCTAGTGGTGTGCT 
TATTAATGTGGCTTTGGGTAAATGCATTTCCATTGAAAACACTACAGTCATTCTGGAAGACTGC 
GATGGGAGCAAAGAGCTTCAACAATTTAATTACACCTGGTTAAGACTTATTAAATGTGGAGAAT 
GGTGTATAGCCCCCATCCCTGATAAAGGAGCCGTAAGGCTGCACCCTTGTGATAACAGAAACAA 
AGGGCTAAAATGGCTGCATAAATCAACATCAGTCTTTCATCCAGAACTGGTGAATCACATTGTT 
TTTGAAAACAATCAGCAATTATTATGCTTGGAAGGAAATTTTTCTCAAAAGATCCTGAAAGTAG 
CTGCCTGTGACCCAGTGAAGCCATATCAAAAGTGGAAATTTGAAAAATATTATGAAGCCTGA- 



T5 LECTIN Amino acid sequence 

DVGNLTQQRELRKKLKCKS FKWYLENVFPDLRAP I VRASGVL INVALGKC I S I ENTTVILEDCD 
GSKELQQFNYTWLRLI KCGEWCIAPI PDKGAVRLHPCDNRNKGLKWLHKSTSVFHPELVNHI VF 
ENNQQLLCLEGNFSQKILKVAACDPVKPYQKWKFEKYYEA* 



GalNAc-T6 lectin domain: 

The lectin polypeptide sequence comprises amino acid residues 458-622 of 
GALNT6 (GALNT6 nucleotide sequence accession number is AJ 133523) 

T6 LECTIN DNA sequence 

TCCTTCGGTGACATTTCGGAACGACTGCAGCTGAGGGAACAACTGCACTGTCACAACTTTTCCT 
GGTACCTGCACAATGTCTACCCAGAGATGTTTGTTCCTGACCTGACGCCCACCTTCTATGGTGC 
CATCAAGAACCTCGGCACCAACCAATGCCTGGATGTGGGTGAGAACAACCGCGGGGGGAAGCCC 
CTCATCATGTACTCCTGCCACGGCCTTGGCGGCAACCAGTACTTTGAGTACACAACTCAGAGGG 
ACCTTCGCCACAACATCGCAAAGCAGCTGTGTCTACATGTCAGCAAGGGTGCTCTGGGCCTTGG 
GAGCTGTCACTTCACTGGCAAGAATAGCCAGGTCCCCAAGGACGAGGAATGGGAATTGGCCCAG 
GATCAGCTCATCAGGAACTCAGGATCTGGTACCTGCCTGACATCCCAGGACAAAAAGCCAGCCA 
TGGCCCCCTGCAATCCCAGTGACCCCCATCAGTTGTGGCTCTTTGTC TAG-stop 

T6 LECTIN Amino acid sequence 
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SFGDISERLQLREQLHCHNFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQCLDVGENNRGGKP 
LIMYSCHGLGGNQYFEYTTQRDLRHNIAKQLCLHVSKGALGLGSCHFTGKNSQVPKDEEWELAQ 
DQLIRNSGSGTCLTSQDKKPAMAPCNPSDPHQLWLFV* 

5 

GalNAc-T7 lectin domain: 

The lectin domain polypeptide sequence comprises amino acid residues 492-657 of 
GALNT7 (GALNT7 nucleotide sequence accession number is AJ505958). 

10 T7 LECTIN DNA sequence 

TATGGGGATATATCGGAGCTGAAAAAATTTCGAGAAGATCACAACTGCCAAAGTTTTAAGTGGT 
TCATGGAAGAAATAGCTTATGATATCACCTCACACTACCCTTTGCCACCCAAAAATGTTGACTG 
GGGAGAAATCAGAGGCTTCGAAACTGCTTACTGCATTGATAGCATGGGAAAAACAAATGGAGGC 
TTTGTTGAACTAGGACCCTGCCACAGGATGGGAGGGAATCAGCTTTTCAGAATCAATGAAGCAA 

1 5 ATCAACTCATGCAGTATGACCAGTGTTTGACAAAGGGAGCTGATGGATCAAAAGTTATGATTAC 
ACACTGTAATCTAAATGAATTTAAGGAATGGCAGTACTTCAAGAACCTGCACAGATTTACTCAT 
ATTCCTTCAGGAAAGTGTTTAGATCGCTCAGAGGTCCTGCATCAAGTATTCATCTCCAATTGTG 
ACTCCAGTAAAACGACTCAAAAATGGGAAATGAATAACATCCATAGTGTT TAG - s top 

20 T7 LECTIN Amino acid sequence 

YGDISELKKFREDHNCQSFKWFMEEIAYDITSHYPLPPKNVDWGEIRGFETAYCIDSMG 
KTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGADGSKVMITHCNLNEFKE 
WQWKNLHRFTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNIHSV* 

25 

GalNAc-T8 lectin domain; 

The lectin domain polypeptide sequence comprises amino acid residues 459-637 of 
GALNT8 (GALNT8 nucleotide sequence accession number is AJ505959). 

30 T8 LECTIN DNA sequence 

GACGTTTCTTCCAGAATGGCACTCCGGGAAAAACTGAAATGTAAAACTTTTGACTG 
GTACCTGAAAAATGTTTATCCACTCTTGAAGCCACTCCACACCATCGTGGGCTATGG 
AAGAATGAAAAACCTATTGGATGAAAATGTCTGCTTGGATCAGGGACCCGTTCCAG 
GCAACACCCCCATCATGTATTACTGCCATGAATTCAGCTCACAGAATGTCTACTATC 

35 ACCTAACTGGGGAGCTCTATGTGGGACAACTGATTGCAGAGGCCAGTGCTAGTGAT 
CGCTGCCTGACAGACCCTGGCAAGGCGGAGAAGCCCACCTTAGAACCATGCTCCAA 
GGCAGCTAAGAATAGACTGCATATATATTGGGATTTTAAACCGGGAGGAGCTGTCA 
TAAACAGAGATACCAAGCGGTGTCTGGAGATGAAGAAGGATCTTTTGGGTAGCCAC 
GTGCTTGTGCTCCAGACCTGTAGCACGCAAGTGTGGGAAATCCAGCACACTGTCAG 

40 AGACTGGGGTCAGACCAACAGCCAGTGA// 

T8 LECTIN Amino acid sequence 

FGDVSSRMALREKLKCKTFDWYLKhTSTiTLLKPLHTIVGYGRMKNLLDENVCLDQGPV 
PGmriMYYCHEFSSQNVYYHLTGELYVGQLIAEASASDRCLTDPGKAEKPTLEPCSKA 
45 AKNRLHIYWDFKPGGAVINRDTKRCLEMKKDLLGSHVLVLQTCSTQVWEIQHTV^ 
GQTNSQ// 
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GalNAc-T9 lectin domain: 

The lectin domain polypeptide sequence comprises amino acid residues 427-603 of 
GALNT9 (GALNT9 nucleotide sequence accession number is AJ505960). 

5 T9 LECTIN DNA sequence 

TTCGGGGACGTGTCTGAGAGGCTGGCCCTGCGTCAGAGGCTGAAGTGTCGCAGCTTCAAGTGGT 
ACCTGGAGAACGTGTACCCGGAGATGAGGGTCTACAACAACACCCTCACGTACGGAGAGGTGAG 
AAACAGCAAAGCCAGTGCCTACTGTCTGGACCAGGGAGCGGAGGACGGCGACCGGGCGATCCTC 
TACCCCTGCCACGGGATGTCCTCCCAGCTGGTGCGGTACAGCGCTGACGGCCTGCTGCAGCTGG 
10 GGCCTCTGGGCTCCACAGCCTTCTTGCCTGACTCCAAGTGTCTGGTGGATGACGGCACGGGCCG 
CATGCCCACCCTGAAGAGGTGTGAGGATGTGGCGCGGCCAACACAGCGGCTGTGGGACTTCACC 
CAGAGTGGCCCCATTGTGAGCCGGGCCACGGGCCGCTGCCTGGAGGTGGAGATGTCCAAAGATG 
CCAACTTTGGGCTCCGGCTGGTGGTACAGAGGTGCTCGGGGCAGAAGTGGATGATCAGAAACTG 
GATCAAACACGCACGGCAC TGA- s top 

15 

T9 LECTIN Amino acid sequence 

FGDVSERLALRQRLKCRSFKWYLENVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAE 
DGDRAILYPCHGMSSQLVRYSADGLLQLGPLGSTAFLPDSKCLVDDGTGRMPTLKRCE 
DVARPTQRLWDFTQSGPIVSRATGRCLEVEMSKDANFGLRLWQRCSGQKWMIRNWI 
20 KHARH* 

GalNAc-TlO lectin domain: 

The lectin domain polypeptide sequence comprises amino acid residues 417-603 of 
25 GALNTIO (GALNTIO nucleotide sequence accession number is AJ505950). 

TIG LECTIN DNA sequence 

GCTGGGGATGTCGCAGTCCAGAAAAAGCTCCGCAGCTCCCTTAACTGCAAGAGTTTCAAGTGGTTTATGAC 
GAAGATAGCCTGGGACCTGCCCAAATTCTACCCACCCGTGGAGCCCCCGGCTGCAGCTTGGGGGGAGATCC 

30 GAAATGTGGGCACAGGGCTGTGTGCAGACACAAAGCACGGGGCCTTGGGCTCCCCACTAAGGCTAGAGGGC 
TGCGTCCGAGGCCGTGGGGAGGCTGCCTGGAACAACATGCAGGTATTCACCTTCACCTGGAGAGAGGACAT 
CCGGCCTGGAGACCCCCAGCACACCAAGAAGTTCTGCTTTGATGCCATTTCCCACACCAGCCCTGTCACGC 
TGTACGACTGCCACAGCATGAAGGGCAACCAGCTGTGGAAATACCGCAAAGACAAGACCCTGTACCACCCT 
GTCAGTGGCAGCTGCATGGACTGCAGTGAAAGTGACCATAGGATCTTCATGAACACCTGCAACCCATCCTC 

35 TCTCACCCAGCAGTGGCTGTTTGAACACACCAACTCAACAGTCTTGGAAAAATTCAATAGGAACTGA 

TIG LECTIN Amino acid sequence 

AGDVAVQKJCLRSSLNCKSFKWFMTKL\WDLPKFYPPVEPPAAAWGEIRNVGTGLCADTKHG 
GSPLRLEGCVRGRGEAAWNNMQWTFTWREDIRPGDPQHTKKFCFDAISHTSPVTLTO 
40 NQLWKYRKDKTLYHPVSGSCMDCSESDHWFMNTCM>SSLTQQWLFEHTNSTVLEKFNRN* 

GalNAc-Tll lectin domain: 

The lectin domain polypeptide sequence comprises amino acid residues 492-608 of 
45 GALNTll (GALNTl 1 nucleotide sequence accession number is Y 12434). 

Til LECTIN DNA sequence 

TGCAATATCAGTGAGCGTGTGGAACTGAGAAAGAAGTTGGGCTGTAAATCATTTAAATGGTATT 
TGGATAATGTATACCCAGAGATGCAGATATCTGGGTCCCACGCCAAACCCCAACAACCCATTTT 



{W:\04305\100H154-US1\00085062.DOC IlillDIinililDllillllimilD } 

63 



TGTCAATAGAGGGCCAAAACGACCCAAAGTCCTTCAACGTGGAAGGCTCTATCACCTCCAGACC 
AACAAATGCCTGGTGGCCCAGGGCCGCCCAAGTCAGAAGGGAGGTCTCGTGGTGCTTAAGGCCT 
GTGACTACAGTGACCCAAATCAGATCTGGATCTATAATGAAGAGCATGAATTGGTTTTAAATAG 
TCTCCTTTGTCTAGATATGTCAGAGACTCGCTCATCAGACCCGCCACGGCTCATGAAATGCCAC 
5 GGGTCAGGAGGATCCCAGCAGTGGACCTTTGGGAAAAACT^TCGGCTATACCAGGTGTC^ 

GACAGTGCCTGAGAGCAGTGGATCCCCTGGGTCAGAAGGGCTCTGTCGCCATGGCGATCTGCGA 
TGGCTCCTCTTCACAGCAGTGGCATTTGGAAGGTTAA 

Til LECTIN Amino acid sequence 
10 MSERVELRKKLGCKSFKWYLDNVWEMQISGSHAKPQQPIFX^GPKR^ 

HLQTNKCLVAQGRPSQKGGLWLKACDYSDPNQIWIYNEEHELVLNSLLCLDMSETR^ 

SDPPRLMKCHGSGGSQQWTFGKNNRLYQVSVGQCLRAVDPLGQKGSVAMMCD 

QQWHLEG* 

15 

GalNAc-T12 lectin domain: 

The lectin domain polypeptide sequence comprises amino acid residues 428-581 of 
GALNT12 (GALNT12 nucleotide sequence accession number is AJ505963). 

20 T12 LECTIN DNA sequence 

TGGGATGTGACAGAGAGGAAGCAGCTCCGGGACAAGCTCCAGTGTAAAGACTTCAAGTGGTTCT 
TGGAGACTGTGTATCCAGAACTGCATGTGCCTGAGGACAGGCCTGGCTTCTTCGGGATGCTCCA 
GAACAAAGGACTAACAGACTACTGCTTTGACTATAACCCTCCCGATGAAAACCAGATTGTGGGA 
CACCAGGTCATTCTGTACCTCTGTCATGGGATGGGCCAGAATCAGTTTTTCGAGTACACGTCCC 

25 AGAAAGAAATACGCTATAACACCCACCAGCCTGAGGGCTGCATTGCTGTGGAAGCAGGAATGGA 
TACCCTTATCATGCATCTCTGCGAAGAAACTGCCCCAGAGAATCAGAAGTTCATCTTGCAGGAG 
GATGGATCTTTATTTCACGAACAGTCCAAGAAATGTGTCCAGGCTGCGAGGAAGGAGTCGAGTG 
ACAGTTTCGTTCCACTCTTACGAGACTGCACCAACTCGGATCATCAGAAATGGTTCTTCAAAGA 
GCGCATGTTATGA 

30 

T12 LECTIN Amino acid sequence 

DVTERKQLRDKLQCKDFKWFLETVYPELHVPEDRPGFFGMLQNKGLTDYCFDYNPPDENQIVGH 
QVILYLCHGMGQNQFFEYTSQKEIRYNTHQPEGCIAVEAGMDTLIMHLCEETAPENQKFILQED 
GSLFHEQSKKCVQAARKESSDSFVPLLRDCTNSDHQKWFFKERML* 

35 

GalNAc-T13 lectin domain: 

The lectin domain polypeptide sequence comprises amino acid residues 466-639 of 
GALNT13 (GALNT13 nucleotide sequence accession number is AJ505964). 

40 

T13 LECTIN DNA sequence 

TCTGAGAAGCCAGACTGCATGGAACGCTTGCAGCTGCAAAGGAGACTGGGTTGTCGGACATTCC 
ACTGGTTTCTGGCTAATGTCTACCCTGAGCTGTACCCATCTGAACCCAGGCCCAGTTTCTCTGG 
AAAGCTCCACAACACTGGACTTGGGCTCTGTGCAGACTGCCAGGCAGAAGGGGACATCCTGGGC 
45 TGTCCCATGGTGTTGGCTCCTTGCAGTGACAGCCGGCAGCAACAGTACCTGCAGCACACCAGCA 
GGAAGGAGATTCACTTTGGCAGCCCACAGCACCTGTGCTTTGCTGTCAGGCAGGAGCAGGTGAT 
TCTTCAGAACTGCACGGAGGAAGGCCTGGCCATCCACCAGCAGCACTGGGACTTCCAGGAGAAT 
GGGATGATTGTCCACATTCTTTCTGGGAAATGCATGGAAGCTGTGGTGCAAGAAAACAATAT^G 
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ATTTGTACCTGCGTCCGTGTGATGGAAAAGCCCGCCAGCAGTGGCGTTTTGACCAGATCAATGC 
TGTGGATGAACGATGA 

T13 LECTIN Amino acid sequence 
5 EKPDCMERLQLQRRLGCRTFHWFLANVYPELYPSEPRPSFSGKLHNTGLGLCADCQAEGDILGC 
PMVLAPCSDSRQQQYLQHTSRKEIHFGSPQHLCFAVRQEQVILQNCTEEGLAIHQQHWDFQENG 
MIVHILSGKCMEAWQENNKDLYLRPCDGKARQQWRFDQINAVDER* 



10 GalNAc-T14 lectin domain: 

The lectin domain polypeptide sequence comprises amino acid residues 352-516 of 
GALNT14 (GALNT14 nucleotide sequence accession number is AJ505991). 

T14 LECTIN DNA sequence 

15 TATGGAGATGTGTCAGTCAGAAAAACACTAAGAGAAAATCTGAAGTGTAAGCCCTTTTCTTGGTACCTAGA 
AAACATCTATCCGGACTCCCAGATCCCAAGACGTTATTACTCACTTGGTGAGATAAGAAATGTTGAAACCA 
ATCAGTGTTTAGACAACATGGGCCGCAAGGAAAATGAAAAAGTGGGTATATTCAACTGTCATGGTATGGGA 
GGAAATCAGGTATTTTCTTACACTGCTGACAAAGAAATCCGAACCGATGACTTGTGCTTGGATGTTTCTAG 
ACTCAATGGACCTGTAATCATGTTAAAATGCCACCATATGAGAGGAAATCAGTTATGGGAATATGATGCTG 

20 AGAGACTCACGTTGCGACATGTT7VACAGTAACCAATGTCTCGATGAACCTTCTGAAGAAGACAAAATGGTG 
CCTACAATGCAGGACTGTAGTGGAAGCAGATCCCAACAGTGGCTGCTAAGGAACATGACCTTGGGCACATG 
A 

T14 LECTIN Amino acid sequence 
25 YGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLD^^ 

EKVGIFNCHGMGGNQWSYTADKEIRTDDLCLDVSRLNGPVMLKCHHMRGNQLWEY 
DAERLTLRHVNSNQCLDEPSEEDKMVPTMQDCSGSRSQQWLLRNMTLGT* 



30 GalNAc-T15 lectin domain: 

The lectin domain polypeptide sequence comprises amino acid residues 382-552 of 
GALNT15 (GALNT15 nucleotide sequence accession number is AJ505966). 

T15 LECTIN DNA sequence 

35 TCGGGAATGTTGAGAGCAGATTGGACCTGAGGAAGAATCTGCGCTGCCAGAGCTTCAAGTGGTA 
CCTGGAGAATATCTACCCTGAACTCAGCATCCCCAAGGAGTCCTCCATCCAGAAGGGCAATATC 
CGACAGAGACAGAAGTGCCTGGAATCTCAAAGGCAGAACAACCAAGAAACCCCAAACCTAAAGT 
TGAGCCCCTGTGCCAAGGTCAAAGGCGAAGATGCAAAGTCCCAGGTATGGGCCTTCACATACAC 
CCAGAAGATCCTCCAGGAGGAGCTGTGCCTGTCAGTCATCACCTTGTTCCCTGGCGCCCCAGTG 

40 GTTCTTGTCCTTTGCAAGAATGGAGATGACCGACAGCAATGGACCAAAACTGGTTCCCACATCG 
AGCACATAGCATCCCACCTCTGCCTCGATACAGATATGTTCGGTGATGGCACCGAGAACGGCAA 
GGA/y^TCGGCGTCAACCCATGTGAGTCCTCACTCATGAGCCAGCACTGGGACATGGTGAGTTCT 
TGAG 

45 T15 LECTIN Amino acid sequence 

FGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQKGNIRQRQKCLESQRQNNQETPNLK 
LSPCAKVKGEDAKSQVWAFTYTQKILQEELCLSVITLFPGAPWLVLCKNGDDRQQWTKTGSHI 
EHIASHLCLDTDMFGDGTENGKEIGVNPCESSLMSQHWDMVSS* 
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GaiNAc-T16 lectin domain; 

The lectin domain polypeptide sequence comprises amino acid residues 396-558 of 
GALNT16 (GALNT16 nucleotide sequence accession number is AJ505951). 

5 

T16 LECTIN DNA sequence 

AGTGTGGCTACGCGGATAGAGCAGAGGAAGAAGATGAACTGCAAGTCCTTCCGCTGGTACCT 
GGAGAACGTCTACCCAGAGCTCACGGTCCCCGTGAAGGAAGCACTCCCCGGCATCATTAAGC 
AGGGGGTGAACTGCTTAGAATCTCAGGGCCAGAACACAGCTGGTGACTTCCTGCTTGGAATG 
10 GGGATCTGCAGAGGGTCTGCCAAGAACCCGCAGCCCGCCCAGGCATGGCTGTTCAGTGACCA 
CCTCATCCAGCAGCAGGGGAAGTGCCTGGCTGCCACCTCCACCTTAATGTCCTCCCCTGGATC 
CCCAGTCATACTGCAGATGTGCAACCCTAGAGAAGGCAAGCAGAAATGGAGGAGAAAAGGA 
TCTTTCATCCAGCATTCAGTCAGTGGCCTCTGCCTGGAGACAAAGCCTGCCCAGCTGGTGACC 
AGCAAGTGTCAGGCTGACGCCCAGGCCCAGCAGTGGCAGCTGTTGCCACACACATGA 

15 

T16 LECTIN Amino acid sequence 

SVATRIEQRKKMNCKSFRWYLENVYPELTVPVKEALPGI I KQGVNCLESQGQNTAGDFLL 
GMGICRGSAKNPQPAQAWLFSDHLIQQQGKCLAATSTLMSSPGSPVILQMCNPREGKQKWRRKGSF 
IQHSVSGLCLETKPAQLVTSKCQADAQAQQWQLLPHT* 

20 

In this Example we have defined minimal sequences of functional lectin domains 
based on multiple sequence alignments. It is clear that changes in the length of sequences 
used may not affect functionality of the lectins. Such changes could constitute, for example, 
plus or minus 10-20 amino acid residues of the GalNAc-transferase sequence at their amino 

25 or carboxy termini. . For example, the GalNAc-Tl lectin domain may comprise 10-20 

fewer amino acid residues at its carboxy and/or amino termini than shown in Table Hi's Tl 
lectin domain sequence; i.e. the Tl lectin domain could, for example, stretch from amino 
acids 403-549 of the GALNTl sequence, or, for example, from amino acids 413-539 of the 
GALNTl sequence. Additionally, the GalNAc-Tl lectin domain may comprise 10-20 more 

30 amino acid residues at its carboxy and/or amino termini than shown in Table Ill's Tl 

lection domain sequence; i.e. the Tl lectin domain could, for example, stretch from amino 
acids 383-569 of the GALNTl sequence, or, for example, from amino acids 373-579 of the 
GALNTl sequence. 

Sf9 cells were co-transfected with pACGP67-GalNAc-transferase soluble 

35 expression constructs and Baculo-Gold^*^ DNA (Pharmingen) as previously described 

Briefly, 0.4 \xg DNA was mixed with 0.1 jag Baculo-Gold DNA and co-transfected in Sf9 
cells in 24-well plates. Ninety-six hours post-transfection recombinant virus was ampUfied 
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in 6-well plates at dilutions of 1 : 10 and 1 :50. Titer of amplified virus was estimated by 
titration in 24-well plates. For large scale production and purification of recombinant 
secreted enzymes and lectins the amplified vira were used to infect High Five*^^ cells 
grown in serum fi-ee medium (Invitrogen) in upright roller bottles shaking at 140 rpm in 27 
5 °C waterbaths. Recombinant proteins were purified by nickel NiTA chromatography using 
nickel agarose (Qiagen) as recommended by the manufacturer or by consecutive 
chromatographies on Amberlite, S-sepharose and Mono-S as previously described 

2. Direct binding assay for determination of carbohydrate specificity of 
polypeptide GalNAc-tranferase lectins using soluble GalNAc-transferase enzymes: 

10 GalNAc-transferase lectins were previously shown to direct GalNAc-glycopeptide 

substrate specificities of some GalNAc-transferase ^. The mechanism by which the lectin 
domains mediate this specificity is unknown but the finding that the monosaccharide 
GalNAc selectively inhibits GalNAc-glycopeptide specificity of some isoforais suggested 
that the putative lectin domains were involved in an interaction with the substrate at least 

15 partly through the GalNAc-residue. Nevertheless, it has not been possible in the past 
despite many different attempts to demonstrate direct binding of the enzyme protein or 
fragments hereof to glycopeptides or saccharides ^(PCT WO 01/85215 A2). In this 
Example a binding assay using HIS-tagged affinity purified and biotinylated secreted 
enzyme was developed. HIS-tagged secreted human GalNAc-T2 and -T4 were prepared 

20 from pAC-GP67-T2-sol and pAC-GP67-T4-sol cDNA ^^^^ by PGR as described in 
Example L 

Secreted GalNAc-T2 and -T4 and variant proteins were obtained from infected 
High Five'^^ cells grown in serum- free medium (Invitrogen) in upright roller bottles shaken 
140 rpm in waterbaths at 27 °C. Purification of the recombinant proteins were performed 
25 by iminodiacetic acid metal affinity chromatography (EVLAC) Ni^^-charged (QIAGEN). 
Elution was achieved with 250 mM imidazole in 50 mM sodixmi phosphate (pH 8.0) and 
500 mM NaCl. In some cases, recombinant proteins were purified by consecutive 
ionexchange chromatographies as developed and described previously before final 
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purification by Ni ^-chromatography. Proteins eluted were dialyzed three times against 
PBS (10 mM sodium phosphate (pH 7.4) 150 mM NaCl) and concentrated by centrifugal 
filter device (Millipore; 10,000 kDa cut off). Purity was analyzed by SDS-PAGE under 
reducing conditions, and stained for proteins with Coomassie Blue R 250. 
5 Protein biotinylation was made as previously reported The pH of 1 ml purified 

protein (0.3 mg/ml) in PBS was adjusted to pH 9 with 1 M NaOH and 40 |il N-hydroxy- 
succinimidobiotin (Sigma) dissolved in DMF (10 mg/ml) was added. The solution was 
mixed end-over-end for 2 hours at room temperature, and dialyzed three times against PBS 
and an equal volume of glycerol was added. The biotinylated proteins were stored at -20°C 

10 in 50% glycerol imtil use. 

Glycosylation of MUCl peptides (0.1 mM) was made in 20 mM cacodylate buffer 
(pH 8.0), 10 mM MnCb, 10 mM UDP-GalNAc, and 20 ^g purified polypeptide GalNAc- 
Tl or -T2 with or without subsequent glycosylation with GalNAc-T4 at 37°C during 
overnight. Glycopeptides were purified by C-18 reverse phase HPLC. Peptides were 

15 custom synthesized by Neosystems (Strasbourg). Biotinylated Helix Pomatia lectin (HP A) 
was from KemEnTec (Denmark). Anti-MUCl HMFG2 monoclonal antibody was a 
generous gift from Joyce Taylor-Papadimitriou. Anti-MUCl 5E5 monoclonal antibody was 
developed by immunizing Balb/c mice with 60-mer MUCl tandem repeat peptide 
glycosylated with 5 moles GalNAc per repeat. Monoclonal antibodies to the lectin domains 

20 of human GalNAc-T2 and -T4 were developed as previously described 

Direct binding ELISA assay was developed as follows: Polystyrene microtiter 
plates (Maxisorb, Nunc, Denmark) were coated with peptides or enzymatically 
glycosylated glycopeptides in PBS overnight at 4*'C. Plates were washed and blocked with 
0.1% Tween20 and 0.2% BSA in PBS for 1 h at room temperature, followed by incubation 

25 with biotinylated proteins in PBS with 0.05% Tween20 for 2 h at room temperature. After 
four washes with PBS, plates were incubated with 1 :2,000 dilution streptavidin-HRP 
(Sigma) in PBS for 30 min at room temperature and washed four times with PBS. 
Development was performed with 0.5 mg/ml o-phenylenediamine and 0.02% H2O2 at room 
temperature for 15 min, and reaction stopped by adding 100 ^I/well of 0.5 N H2SO4. 
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Competitive inhibition assays were done at end-point titers of GalNAc-transferase proteins 
with one hour preincubations with inhibitor. 

In initial binding experiments it was determined that secreted GalNAc-T2 and -T4 
could bind their peptides substrates in the presence of 5 mM UDP and Mn"*^ (may be 
5 substituted with other divalent cation), whereas no binding was observed in the absence or 
when 10 mM EDTA was included. This correlates with our previous findings that GalNAc- 
T2 can be purified on an acceptor substrate peptide in the presence of UDP and Mn"*^, and 
can be eluted by removing UDP in EDTA This binding is mediated by the catalytic unit 
of the enzyme, which exposes the acceptor-binding pocket only in the presence of UDP and 
1 0 Mn"^, predicted by the ordered catalytic reaction. 

In order to selectively evaluate the binding characteristics of the putative lectin 
domain, assays were carried out in the absence of UDP and Mn^. Significant binding to 
GalNAc-peptides was found for both GaIN[Ac-T2 and -T4 (Figure 4). 

We have previously shown that GalNAc-T4 with a single amino acid change in the 
15 lectin domain selectively impairs the GalNAc-glycopeptide substrate specificity of this 
enzyme ^. In agreement with this the lectin mutated enzyme protein did not bind GalNAc- 
glycopeptides. The binding to GalNAc-glycopeptides were unaffected by Ca++ and EDTA 
fiirther confirming that the catalytic units of the enzyme proteins are not involve in binding. 

GalNAc and GalNAca-benzyl have previously been shown to inhibit the GaDSfAc- 
20 glycopeptide substrate specificity of GalNAc-T4 ^ (PCT WO 01/85215 A2). Lectins and 
antibodies to carbohydrates usually recognize the anomeric configuration of the sugar 
structures they bind. However, surprisingly, both GalNAc-T2 and -T4 exhibit equal 
inhibition with GalNAca-benzyl (Sigma) and GalNAc^-benzyl (NuRx, Alberta Research 
Council (Fig. 5).^^' Similar results were obtained with other aryl derivatives. 
25 The methods described in this Example utilize recombinant polypeptide GalNAc- 

transferases in binding assays and excludes potential binding activity through the catalytic 
imit. It is clear that recombinant polypeptide GaMAc-transferases with mutations that 
inactivates the binding activity of the catalytic unit can be used, as well as any truncation of 
the enzyme protein that eliminate the binding activity of the catalytic unit. 
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While this binding assay establishes a method for screening for inhibitors of lection 
mediated binding mediated through the lectins of human GalNAc-T2 and GalNAc-T4, it is 
clear that the same method with modifications can be appUed to all animal and mammalian 
polypeptide GalNAc-transferases with a functional lectin domain. The ligand target used in 
5 this Example is a GalNAc-MUCl glycopeptide produced enzymatically firom synthetic 
peptides. It is clear that GalNAc-glycopeptides based on any number of peptides with 
GalNAc attached can be used as target for the binding assay. It is also clear that the assay 
developed can be modified to accommodate high through put screening by any assay 
method available in the prior art that can detect and quantify binding between the 
10 polypeptide GalNAc-tranferase mediated through its lectin domain and a suitable ligand. 

3. Direct binding assay for determination of carbohydrate specificity of 
polypeptide GalNAc-tranferase lectins using truncated GalNAc-transferase lectin 
domains: 

In Example 2 it was established that secreted soluble enzymatically active proteins 
15 of GalNAc-T2 and -T4 bind GalNAc-MUCl glycopeptides, and that GalNAc could inhibit 
the binding. The catalytic unit of polypeptide GalNAc-transferase can interact and bind 
acceptor substrate peptides and possible glycopeptides however, binding studies without 
donor substrates (UDP), and in the presence of EDTA to chelate Mn^^, suggested that the 
binding was not mediated through the catalytic unit. In this Example direct binding to 
20 GalNAc-glycopeptides through the lectin domains of polypeptide GalNAc-transferases 
GalNAc-T2 and -T4 was established. Attempts to express C-terminal truncated GalNAc- 
transferase proteins failed due to low secretion rate presumable related to folding problems 
and intracellular degradation. Similar phenomenon has recently been reported for GalNAc- 
Tl Numerous attempts to express isolated lectin domains in insect cells and P.pastoris 
25 have failed due to low expression and apparent degradation. As described in Example 1 
successful expression was finally achieved with constmcts truncated as described in Table 
n using an expression vector with a N-terminal HIS tag and thrombin cleavage site as well 
as a T7 tag. HIS-tagged truncated GalNAc-T2 and -T4 lectins were expressed and purified 
as described in Example 2. Lectins were used in binding studies directly or after 
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biotinylation as described in Example 2. In binding studies using lectins without 
biotinylation antibodies to the HIS-tag and the T7-tag, or in some experiments antibodies 
raised to GalNAc-T2 and -T4 enzymes were used to detect binding. 

Inhibition experiments were used to further define the binding specificity of 
GalNAc-T2 and -T4 secreted soluble enzymes as well as lectin domains compared to Helix 
Pomatia lectin (Table IV). GalNAca- and GalNAcp-aryl structures inhibited binding of 
both enzymes and isolated lectins at comparable levels. In contrast, Helix Pomatia showed 
strong preference for GalNAca-derivatives. Gal and other sugars had no inhibitory effect. 
Interestingly, UDP-GalNAc was not a significant inhibitor of the GalNAc-transferase lectin 
binding, but a strong inhibitor of Helix Pomatia binding. 
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Table IV. Inhibition of carbohydrate-binding proteins by using saccharides and related 
structures. 



Carbohydrates 


T21d 


sT2 


T41d 


sT4 


HPA 


Glc 


>100" 


>100 


>100 

X \J\J 


>100 

^ X \J\J 


>100 

^ X \J\J 


GlcNAc 


>100 


>100 


>100 

^ I \J\J 


>ion 


O 


BzlaGlcNAc 


>100 

L \J\J 


>100 


>100 

X \J\J 


>100 


2 


BzlBGlcNAc 


>100 


>100 


>100 

— X \J\J 


>100 


>20 


Gal 


>100 


>100 


>100 


50 


>100 


MeaGal 


>100 


>100 


>100 


50 


>100 


MeBGal 


>100 


>100 


>100 


40 


>100 


GalNAc 


37 


15 


5 


1 


2 


BzlaGalNAc 


20 


15 


5 


1 


0.5 








ij 


1 
I 


1 
1 


oNPaGalNAc 


12 


7 


5 


0.5 


1.5 


oNPpGalNAc 


>12 


10 


8 


1 


>20 


pNPaGalNAc 


>10 


>10 


10 


1 


2 


pNPpOalNAc 


>10 


>10 


8 


0.8 


>20 


UDPaGalNAc 


>100 


65 


50 


30 


2 


UDP 


>100 


>100 


>100 


>100 


>100 


Lactose 


>100 


>100 


>100 


>100 


ND 


EDTA 


>10 


>10 


>10 


>10 


>10 



^ Concentration (mM) required for 50% inhibition (IC50). 
ND, not determined. 
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While this binding assay estabhshes a method for screening for inhibitors of 
isolated human GalNAc-T2 and GalNAc-T4 lections, it is clear that the same method with 
modifications can be applied to all animal and mammalian polypeptide GalNAc-transferase 
lectins. The ligand target used in this Example is a GalNAc-MUCl glycopeptide produced 

5 enzymatically fi"om synthetic peptides. It is clear that GalNAc-glycopeptides based on any 
number of peptides with GalNAc attached can be used as target for the binding assay. It is 
also clear that the assay developed can be modified to accommodate high through put 
screening by any assay method available in the prior art that can detect and quantify 
binding between the isolated lectin and a suitable Ugand. 

10 The methods described in this Example utilize recombinant GalNAc-transferase 

lectins in binding assays which excludes potential binding activity through the catalytic 
unit. It is clear that recombinant polypeptide GaBsf Ac-transferases with mutations that 
inactivates the binding activity of the catalytic vmit can be used, as well as any truncation of 
the enzyme protein that eliminate the binding activity of the catalytic unit. 

15 4. Establishment of cell line model systems for cell surface expression of mucin 

and secreted mucin - Stably transfected CHO and CHO IdlD cells: 

Cell lines and expression constructs: Wild type Chinese Hamster Ovary cells 
(CHO) and the glycosylation deficient mutant cell line CHO IdlD were stably transfected 
with a fiiU coding MUCl construct (MUCIF, suppHed by M.A. HolUngsworth, Nebraska, 

20 USA) containing 32 tandem repeats using the pCDNAS vector (Invitrogen). A secreted 
MUCl construct (MUCl-IgGHIS) was generate by insertion of mouse IgGy2a domain 
fixsed to 6x histidine tag at the BsU36I site dovmstream of the tandem repeat region of 
MUCIF Cells were generally grown in Hams F12 containing 10% Fetal Bovine Serum 
at C at 5% CO2, and plated 12-24 hours prior to transfection in 6 well plates and grown 

25 to approximately 50% confluency. One hour before transfection cells were washed m 
serum firee medium Optimem (Invitrogen) and cells were transfected with 1-2 jag DNA 
using the Lipofectamine plus reagent (Invitrogen) in a total volume of 1 mL as 
recommended by the supplier. Three hours after the transfection one mL of Hams F12 
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containing 10% Fetal Bovine Serum was added and cells grown 24-48 hours before 
medium was rq^laced with 2 mL Hams F12 containing 10% Fetal Bovine Serum. Two to 
three days after transfection cells were trypzinized and plated in 75 mL T-flasks or in 24/96 
well microtiter plates in the same medium containing the appropriate selection agent (1 
5 mg/mL G41 8 or 0.4 mg/mL Zeocin). Selection medium was changed twice weekly until 
clones appeared. The medium used for CHO IdlD cells included 1 mM GalNAc and 0.1 
mM Gal. Transfectant clones were selected by immunocytology with anti-MUCl 
antibodies and SDS-PAGE western blot analysis to demonstrate cell surface expression and 
secretion of MUC 1 . 

10 Immunocytology: Two different procedures were applied: i) For general screening 

purposes, cells grown in plates or flasks were trypsinized, washed in saline, and airdried on 
multiwell coverslides. Slides were fixed in ice-cold acetone and stained with monoclonal 
antibodies and FITC-conjugated rabbit anti-mouse Ig as previously described ii) For 
analysis of cell surface expression, cells were seeded in 6 well plates and grown for 6 hours 

15 in Hams F12 medium with serum until approximately 30-50 % subconfluent. Medium was 
hereafter replaced with Optimem supplemented with 1.0 mM GalNAc and 0.1 mM Gal and 
cells grown for 18 to 42 hours. Cells were washed once in PBS (phosphate buffered saline 
without Calcium and Magnesiimi) and subsequently fixed in 2 ml 3% paraformaldehyde at 
25°C for 20 min followed by three washes with PBS. Free aldehyde groups were quenched 

20 by incubating in 2 mL 50 mM Ammonium Chloride in PBS for 1 0 min, followed by three 
washes with PBS and three washes with 5 min incubations each with PBS containing 0.2 % 
Fish Skin Gelatin (Sigma). Immunostaining of cells was performed by incubation with 
monoclonal antibodies for 40 min at 25°C, followed by three washes with PBS and three 
washes of 5 min each with PBS containing 0.2 % Fish Skin Gelatin. Subsequently, cells 

25 were incubated with FITC-conjugated rabbit anti-mouse Ig (Dako, F261) diluted 1 : 1 50 in 
PBS containing 0.2 % Fish Skin Gelatin) for 20 min at 25°C, followed by the same 
washing procedure, after which wells were cut out of plates and mounted with glycerol as 
for glass slides. 
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Characterization of wild type and MUCl stable transfectant CHO cells: Several 
representative clones expressing the full coding or secreted MUCl construct were selected 
and characterized for expression of MUCl as well as 0-glycosylation. Wild type 
CHO/MUClF-clonel expressed MUCl at the cell surface as detected by anti-MUCl 
5 monoclonal antibodies on non-permeabilized cells, while wild type CHO/MUCl sol- 
cloneC4 only was labeled weakly at the surface. Staining with a panel of anti-MUCl 
antibodies of permeabilized cells showed intracellular accumulation of MUCl detected by 
HMFG2 (general anti-MUCl reactive, ^^), SM3 (reactive with cancer-associated MUCl, 
^\ VU-4H5 (reactive with low density 0-glycosylated MUCl, ^\ VU-2G7 (reactive with 

10 high density 0-glycosylated MUCl, ^\ and a novel antibody 5E5 reactive exclusively 
with STn/Tn-glycosylated MUCl glycoforms. In contrast, staining of non-permeabilized 
cells were only reactive with the anti-MUCl antibodies HMFG2, SMS and weakly VU- 
2G7. Analysis of O-glycosylation using a panel of anti-carbohydrate monoclonal antibodies 
revealed that wild type CHO cells label very weakly with anti-T antibodies (HH8, 3C9, 

15 at the surface after neuraminidase treatment, while untreated cells are negative indicating 
that wild type CHO cells express very little 0-glycoproteins and the glycosylation is 
mainly of sialylated core 1 structure (ST) (Fig. 6). Antibodies to Tn (1E3, 5F4, were 
weakly reactive without and with neuraminidase treatment and antibodies to STn (TKH2, 
3F1, were negative. Staining with the lectins PNA (T), HPA (Tn), SNA (a2,6sialic acid) 

20 and MAA (a2,3sialic acid) were in agreement except the finding of weak reactivity with 
SNA indicating some presence of a2,6 linked sialic acids which may be derived from N- 
linked or 0-linked glycans. These results demonstrate that the main form of O- 
glycosylation found on MUCl expressed in wild type CHO is the sialyl-T structure as 
foimd for other recombinant glycoproteins 

25 Staining of permeabilized wild type CHO/MUC 1 sol-clone-C4 with anti-MUC 1 

antibodies revealed strong intracellular expression of MUCl with HMFG2, SM3, vu-4H5, 
and VU-2G7 (Fig. 6). Staining with anti-carbohydrate antibodies revealed strong 
intracellular staining with anti-T after neuraminidase only, while anti-Tn only labeled 
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weakly. These results indicate that the main glycosylation of MUCl in CHO wild type is 
ST similar to untransfected cells. 

In order to characterize the secreted MUCl product SDS-PAGE western blot 
analysis of harvested culture medium of confluent cultures were performed. Ten to twenty 
5 jiL culture supematant was analysed directly or treated with 0. 1 U/mL neuraminidase (C. 
Perfringes VI, Sigma) for 30-60 min at 37°C. Samples were mixed with SDS sample 
buffer, reduced with DTT, and run on precast 4-20 % gels (Biorad). As shown in Figure 7 
anti-MUCl antibodies detected two forms of MUCl in the mediimi; a low molecular 
weight form migrating as 130-140 kd corresponding to unglycosylated product, and a high 

10 molecular weight form migrating above 250 kd. The high molecular weight form was 
sensitive to neuraminidase treatment as evidenced by a marked shift and retardation in 
migration. It is known that sialylated glyoproteins migrate aberrantly and often 
desialylation results in slower migration by SDS-PAGE analysis regardless of the mass. 
Interestingly, the antibody VU-4H5 reacted mainly with the unglycosylated form and only 

15 a very weak band was found in the high molecular weight forms after neuraminidase 
treatment. This result indicates that the PDTR region is 0-glycosylated as the VU-4H5 
antibody was previously found to tolerate 0-glycosylation most positions in the tandem 
repeat except the PDTR region In agreement with this the antibody VU-2G7 raised 
against a MUCl GalNAc-glycopeptide with only one GalNAc per repeat attached in the 

20 PDTR region reacted strongly with the secreted MUCl . Furthermore, reactivity with the 
anti-T antibody after neuraminidase treatment showed that the main type of O- 
glycosylation on secreted MUCl was sialylated-T. Anti-Tn and STn produced no staining. 

Characterization of mutant and MUCl stable transfectant CHO IdlD cells: CHO 
IdlD cells stably transfected with fiiU coding MUCl, e.g. CH01dlD/MUClF-clone2, 

25 expressed MUCl at the cell surface as detected by anti-MUCl antibodies when cells were 
grown in GalNAc and Gal (Fig. 8). Cells were seeded at approximately 30-50 % 
confluency (approx. 0.2 X 10^ per 6 well plate) in Hams F12 medium supplemented with 
10% Fetal Bovine serum and grovra for 6 hours. Medium was replaced with Optimem with 
or without 1.0 mM GalNAc and/or 0.1 mM Gal, and cells grown for 18-36 hours after 
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which cells were trypsinised and washed in saline and processed as described for 
immunocytology. CH01dlD/MUClF-clone2 cells grown in the absence of sugars and 
analysed after permeabilization produced very little MUCl detectable by HMFG2 but not 
by 5E5. In contrast, cells grown in the presence of only GalNAc strongly expressed MUCl 
5 as evaluated by HMFG2 and 5E5, specifically reactive with GalNAc-MUCl . In agreement 
with reactivity with 5E5 these cells also labeled strongly with anti-Tn antibodies, 5F4 and 
1E3, while anti-T antibodies, HH8 and 3C9, did not label the cells. Very weak or no 
staining with anti-STn antibodies (3F1 and TKH2) indicates that a2,6 sialylation to form 
STn does not occur in CHO IdlD cells. CH01dlDMUClF-clone2 cells grown in the 

10 presence of both GalNAc and Gal show reactivity at the surface with anti-T antibodies 

(HH8 and 3C9) only after neuraminidase pretreatment, confirming previous results that the 
predominiant glycoform in CHO cells is sialyl-T (Fig 8). No staining with anti-Tn or STn 
antibodies was detected with cells grown in both Gal and GalNAc. CHOldUD/MUClF- 
clone2 cells grown in the absence of GalNAc and Gal or in the presence of GalNAc alone 

15 showed no reactivity with anti-Tn and T antibodies or lectins (DBA, HP A, WA, PNA, not 
shown) were detected indicating complete lack of 0-glycosylation (Fig 8). Cell surface 
expression of MUCl was detected in CH01dlD/MUClF-clone2 cells grown in the 
presence of GaMAc, while cells grown without GalNAc showed no or only weak 
expression of MUCl at the surface (Fig 8). Surface expression of MUCl was detected with 

20 HMFG2 in cells grown in GaMAc as well as cells grown in both Gal and GalNAc, 

however, expression analysed with the Tn/STn-MUCl glycoform specific antibody 5E5 
revealed surface expression only with cells grown in the presence of GalNAc (Fig 8). This 
latter finding is in agreement with the 0-glycosylation pattem determined above in these 
cells. 

25 CHOldlD/MUCsol-cloneDS secretes MUCl to the culture medium, and 

permeabilized cells immunostain with antibodies to MUCl in the cytoplasm. Cells were 
grown in Hams F12 medium supplemented with 10% Fetal Bovine serum and seeded at a 
density of 0.2 x 10^ in 6 well plates. Following growth for 6 hours, the medium was 
replaced with Optimem supplemented with 1 mM GalNAc, 0.1 mM Gal, or 1 mM GalNAc 
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and 0.1 mM Gal, and cells grown for 18-72 hours. Secretion of MUCl was monitored by 
immunochemical assays of culture supematants at differing time points. SDS-PAGE 
western blot analysis was performed with 5 ^1 culture supernatant mixed with 5 jil of 2x 
SDS sample buffer containing 1 mM DTT. Samples were heated to 100**C for 2 min and 
5 loaded on a precast 4-20% gradient gel and run at 125 V for 75 min. Transfer to 

nitrocellulose membrane was performed by elecroblotting using Biorad Mini Trans Blot 
apparatus at 350 mA for 1 hour. Membranes were blocked with 1^% skimmed milk 
prepared in dH20 for 2 hours and stained with anti-MUCl and anti-carbohydrate 
monoclonal antibodies for 18 hours at 4 °C, followed by washing with Tris buffered saline 

10 (TBS) (10 mM Tris pH 8,0 with 8.5% NaCl) 5 times for 5 min, and incubation with with 
biotinylated rabbit anti-mouse IgG subclass specific antibodies (1 : 1000 dilution in TBS) 
for 1 hour at 25°C. Following 5 washes for 5 min each in TBS, membranes were incubated 
in HRP conjugated Streptavidin (1 : 3000 dilution in TBS) for 30 min at 25°C. After 5 
washings of 5 min each in TBS the blot was developed in 0.04 % 4-Chloro-l-Naphthol 

15 prepared in 50 mM Tris-HCl (pH 7,4) containing 0.025 % H2O2. Similar to the findings 
with full coding MUCl expressed at the cell surface of CHO IdlD cells, glycosylation of 
the secreted MUCl was dependent on Gal and GalNAc sugars in culture medium. Cells 
grown without sugars produced and secreted low amounts of a low molecular weight 
MUCl molecule of apparent mw of 120-130 kd without glycosylation detectable by 

20 HMFG2 but not 5E5 or anti-Tn and anti-T antibodies (Fig. 9). In contrast, cells grown in 1 
mM GalNAc secreted MUCl glycosylated with GalNAc (Tn) as evidenced by reactivity 
with both HMFG2 and 5E5 as well as anti-Tn antibodies (Fig. 9). The apparent molecular 
weight of secreted Tn-MUCl was 250-300 kd and no significant shift in migration was 
observed with pretreatment with neuraminidase (0.1 U/ml for 30 min at 37°C), suggesting 

25 lack of a2,6sialylation (STn). This was confirmed by lack of staining with anti-STn 
antibodies. Cells grown in both 0.1 mM Gal and 1 mM GalNAc produced and secreted 
MUCl v^th sialylated core 1 (T) glycoforms reactive with HMFG2 but not 5E5 (Fig. 9). 
Pretreatment with neuraminidase resulted in a significant shift in migration and reactivity 
with anti-T antibodies as well as the lectin PNA. Two novel anti-MUCl antibodies 
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described recently have been suggested to react with the MUCl tandem repeat sequence 
without (Mab VU-4H5) or with (Mab VU-2G7) 0-glycans attached in the central 
immunodominant epitope -PDTR-. Analysis of secreted MUCl produced in CHO IdlD 
cells grown without GaMAc show reactivity with unglycosylated MUCl migrating at mw 

5 1 20- 1 30 kd with VU-4H5, while no or only weak reactivity was observed when grown in 
GalNAc with or without Gal (Fig. 10). In contrast, the Mab VU-2G7 reacted strongly with 
MUCl migrating at 250-300 kd secreted from cells grown in GalNAc with or without Gal 
(Fig. 10). Although, Mab VU-2G7 reacted weakly with unglycosylated MUCl the 
combined results suggest that MUCl produced in CHO IdlD cells carry 0-glycans on all 

10 five sites of the tandem repeat. 

5. The inhibitor GalNAca-benzyl inhibits MUCl expression independently of O- 
glycan processing. 

As shown in Example 4 CH01dlD/MUClF-clone2 cells grown in the presence of 
GalNAc but not Gal have limited 0-glycosylation capacity, only produce the Tn glycoform 

15 of MUCl, but expresses comparable levels of MUCl at the cell surface as in wild type 
CHO cells or in CHO IdlD cells grown in both GalNAc and Gal. This suggested that cell 
surface expression was not related to 0-glycosylation and particular glycoforms as 
previously proposed (for a review see Huet). We therefore investigated the effect of 
treatment with GalNAca-benzyl of CH01dtD/MUClF-clone2 cells grown in the presence 

20 of GalNAc. CH01dlD/MUClF-clone2 cells were seeded in 6 well plates at a density of 0.2 
X 10^ per well and were grown for 6 hours in Hams F12 medium with serum until 
approximately 30% subconfluent. Medium was hereafter replaced with Optimem 
supplemented with 1.0 mM GalNAc or 1.0 mM GalNAc and O.lmM Gal with or without 
the inhibitors GalNAca-benzyl, GalNAcp-benzyl or the control GlcNAca-benzyl. After 18 

25 hours, the medium was replaced with fresh Optimem containing the sugars and benzyl 

derivatives as above and grown for 12-48 hours. Initially we analyzed surface expression of 
MUCl by immunocytology. Cells were washed once in PBS-CMF (phosphate buffered 
saline Calcium and Magnesium free) after carefully removing the medium from the wells 
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and subsequently fixed in 2 ml 3% paraformaldehyde at 25°C for 20 min followed by three 
washes with PBS-CMF 3 times. Free aldehyde groups were quenched by incubating in 2 
mL 50 mM Ammonium Chloride in PBS-CMF for 10 min, followed by three washes with 
PBS-CMF and three washes with 5 min incubations each with PBS-CMF containing 0.2 % 
5 Fish Skin Gelatin (Sigma). Immunostaining of cells was performed by incubation with 
monoclonal antibodies for 40 min at 25°C, followed by three washes with PBS-CMF and 
three washes of 5 min each with PBS-CMF containing 0.2 % Fish Skin Gelatin. 
Subsequently, cells were incubated with FITC-conjugated rabbit anti-mouse Ig (Dako 
F261)diluted 1: 150 in PBS-CMF containing 0.2 % Fish Skin Gelatin) for 20 min at 25"^^ 

10 followed by the same washing procedure, after which wells were cut out of plates and 
mounted with glycerol as for glass slides. Figure 1 1 shows that treatment with 1 mM 
GalNAca-benzyl, produced strong inhibition of cell surface expression of MUCl, while 
treatment with a similar control benzyl derivative showed no inhibition. GlcNAca-benzyl 
was chosen as a control because this sugar does not serve as a substrate for mammalian 

15 glycosyltransferases and hence was not expected to interfere with O-glycosylation in CHO 
cells. Most anti-MUCl antibodies reacted with cells grown in GalNAc including VU-2G7 
and 5E5, and only VU-4H5 did not react. Reactivity with VU-2G7 indicates that the - 
PDTR- region is 0-glycosylated, while reactivity with 5E5 confirms that the glycoforms of 
surface MUCl is mainly or exclusively Tn. 

20 We next analysed the expression of MUCl produced by CH01dlD/MUClF-clone2 

cells by SDS-PAGE westem analysis. Cells were grown for 24 hours or 48 hours in the 
presence of 1 mM GalNAc or 1 mM GalNAc and 0.1 mM Gal to limit core O- 
glycosylation to GalNAcal-O-Ser/Thr and Galpl-3GalNAcal-0-Ser/Thr, respectively. 
Cells were fiulher treated with 2 mM GalNAca-benzyl, GlcNAca-benzyl or no inhibitor. 

25 Cells were washed and lysed at 24 or 48 hours and the lysates subjected to 

immunoprecipitation with monoclonal antibody HMFG2, which broadly recognize MUCl 
glycoforms. Immunoprecipitates were analysed by SDS-PAGE and westem blot using 
HMFG2 antibody to detect MUCl expression. As shown in Figure 12 the MUCl 
glycoforms at 24 hours expressed by cells grown in GalNAc or Gal and GaESTAc migrated 
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similarly with only little high molecular weight forms in both, indicating that synthesis of 
sialylated core 1 0-glycans were time limited. At 48 hours, MUCl glycoforms migrating 
as higher molecular weight species were expressed more pronounced and selectively by 
cells grown in Gal and GalNAc. Treatment with GlcNAca-benzyl produced the same 

5 glycoforms at similar intensity as cells without treatment. In striking contrast, treatment 
with GalNAca-benzyl had significant effect after 48 hours. A significant reduction in 
MUCl expression was found in cells grown in GalNAc as well as in Gal and GalNAc. In 
the latter case a significant shift in migration into two bands fiirther confirmed that 
GalNAca-benzyl also serves as an inhibitor of 0-glycan extension and reduces O- 

10 glycosylation to GalNAcal-O-Ser/Thr. Analysis of the same blots with the anti-MUCl 
antibody 5E5 (Figure 13) produced essentially the same results except that the antibody 
only labeled the lower migrating band of the two bands labeled by HMFG2 in cells grown 
in Gal and GalNAc and treated with GalNAca-benzyl. This indicates some heterogeneity 
in glycosylation. 

15 These results show for the first time that the effect of GalNAca-benzyl on mucin 

transport and surface expression is independent of its effects on O-glycosylation in striking 
contrast to the prevailing hypothesis Because cells grown in the presence of GalNAc do 
not produce core 1 (Gaipi-3GalNAcal-0-Ser/Thr) O-glycosylation, GalNAca-benzyl 
cannot serve as a competitive substrate for the core 1 p3galactosyltransferase and 

20 subsequently for sialyltransferases. GalNAca-benzyl must therefore exert its fimction on 
mucin transport by another unknown mechanism. 

The in vivo cell line model system developed is one example of a method to screen 
for inhibitors effects of one or more compounds on transport of mucins and 0-linked 
glycoproteins in cells. The Example utilizes MUCl but any mucin or 0-linked glycoprotein 

25 could be used with appropriate expression constructs, antibodies and reagents. The 

developed cell model and modifications hereof can be used for high throughput screens of 
inhibitors in combination with or as a second screen after the binding assays disclosed in 
Examples 2 and 3. 
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6. Identification of a novel selective inhibitor, GalNAcP-benzyl, of polypeptide 
GalNAc-transferase lectins that inhibits MUCl expression without affecting O- 
glycosylation. 

As shown in Examples 2 and 3, polypeptide GalNAc-transferases contain lectin 
5 domains with binding properties for GalNAc-peptides including GalNAc-MUCl peptides. 
Since GalNAca-benzyl was found to inhibit the binding properties of GalNAc-transferase 
lectins, we tested the possibility that the independent effect on mucin expression this O- 
glycosylation inhibitor has, could be related to an inhibitory effect on GalNAc-transferase 
lectins. In Examples 2 and 3 we found surprisingly that the lectin domains of several 

10 GalNAc-transferases in addition to GalNAca-benzyl, which mimics the GalNAc- 

glycopeptide targets of the lectins, also were inhibited by pGalNAc derivatives. Initial tests 
with commercially available GalNAcP and GalNAca derivatives, p-nitrophenyl and 
umbreUiferyl did not produce significant effects in our model system. GalNAcP-benzyl, the 
jS-anomeric configuration of GalNAc-benzyl (there is a jS linkage between the N- 

15 acetylgalactosamine and the benzyl ring), was custom synthesized by Alberta Research 
Council (Canada), and its structure was confimied by mass spectrometry and ^H-NMR. 
CH01dlD/MUClF-clone2 cells were grown for 12 hours in the presence of 1 mM GalNAc 
or 1 mM GalNAc and 0.1 mM Gal to limit core 0-glycosylation to GalNAca 1-0-Ser/Thr 
and Gaipi-3GalNAcal-0-Ser/Thr, respectively. Cells were then treated with 2 mM 

20 GalNAca-benzyl, GalNAcp-benzyl, or GlcNAca-benzyl as control (GlcNAca-benzyl was 
shown in Example 4 to have no effect). Cells were washed and lysed as described above 
after 36 hours and the lysates subjected to immunoprecipitation with anti-MUCl 
monoclonal antibodies HMFG2 or 5E5. Figure 14 illustrates that treatment with GalNAcp- 
benzyl produced the same or better reduction in MUCl expression as treatment with 

25 GalNAca-benzyl in cells grown in GalNAc as well as in Gal and GalNAc. In cells grown 
in Gal and GalNAc MUCl expression was reduced with GalNAcP-benzyl treatment, but in 
contrast to cells treated with GalNAca-benzyl, GalNAcp-benzyl produced no change in the 
migration of MUCl demonstrating that this inhibitor does not affect the 0-glycan 
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processing. The lack of immimoprecipitation of MUCl by antibody 5E5 in cells grown in 
Gal and GalNAc indicates that MUCl, is glycosylated with more complex structures than 
GalNAcal-O-Ser/Thr as recognized by this antibody. As shown in Figure 16 the main O- 
glycan phenotype of CHO IdlD cells grown in Gal and GalNAc is sialylated-T, and 5E5 
5 does not react with MUCl, with T or sialylated T glycoforms of MUCl. Figure 15 

illustrates the same experiment as in Figure 14 except that the detection antibody is 5E5 
and only Tn and STn MUCl glycoforms are visualised. This experiment confirms the 
strong inhibition of MUCl expression in GalNAca-benzyl and GalNAcP-benzyl treated 
cells. 

10 The finding that GalNAca-benzyl and GalNAcp-benzyl exhibit the same inhibitory 

effect on GalNAc-transferase lectin binding, and that they have similar effects on inhibition 
of MUCl expression, clearly indicate that the effects these compounds have on mucin 
expression and secretion are directed through interaction with the lectin domains of 
polypeptide GalNAc-transferases. The effect GalNAca-benzyl has on 0-glycan processing 

15 is a separate phenomenon directed by its ability to serve as a competitive substrate for the 
core 1 pl,3galactosyltransferase. 

GalNAcP-benzyl is the first identified selective inhibitor of polypeptide GalNAc- 
transferase lectins and their roles in transport and secretion, which does not modulate O- 
glycosylation in cells (i.e. does not serve a substrate for mainly core 1 

20 pSgalactosyltransferase activities, a2,6sialyltransferase activities, and core 3 pSGlcNAc- 
transferase activities). Inhibitors structurally related to GalNAcP-benzyl with the same 
properties, as identifiable by the binding assays disclosed in Examples 2 and 3, may be 
designed and syntesized to obtain higher affinity binders. Such inhibitors may be based on 
carbohydrates such as the monosaccharide GalNAc or modifications thereof, inhibitors 

25 may be based on structural and functional mimetics such as polypeptides, glycopeptides, 
DNA, RNA, antibodies, and antibody fi*agments including phage antibodies, and inhibitors 
may be natural or synthetic organic or inorganic compounds. One common feature for such 
preferred inhibitors is the ability to inhibit the binding of one or more polypeptide GalNAc- 
transferase lectins to its binding ligand, such as GalNAc-glycopeptides and mucins as 
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exemplified in Examples 2 and 3. Another feature of the novel inhibitor GalNAcP-benzyl is 
its ability to enter living cells and reach the Golgi apparatus for in vivo binding to 
polypeptide GalNAc-transferase. The hydrophic benzyl aglycone is one example of an aryl 
compound suitable for the jS-anomeric configuration of GalNAc-R , but other aryl 
5 substituents include, without limitation, p-nitrophenyl, umbrelliferyl, and 

naphtalenmethanol. Any pharmaceutical carrier known in the art may be used to achieve 
the same effect. The appropriate carrier will be evident to those skilled in the art and will 
depend in large part upon the route of administration. 

7. Inhibition of secretion of mucins: 

10 Because GalNAca-benzyl exerts separate effects on 0-glycan processing and 

mucin expression, the use of the novel selective inhibitor of mucin expression, GalNAcp- 
benzyl, allow analysis of mucin expression and secretion in different cell line models. 
Examples of human cell lines (available fi-om ATCC, USA) expressing and secreting 
mucins are without limitations LS174T, HT29, Colo205, CALU, MCF7, T47D, NCI- 

15 H292, and A549. Most human adenocarcinoma cell lines express and secret mucins and 
analysis with antibodies to detect protein expression or probes to detect mRNA can reveal 
the types and quantities of mucins. The human colon carcinoma cell line LS174T was 
previously shown to exhibit reduced secretion of mucin following treatment v^th 
GalNAca-benzyl This and other cell lines can be used to treat with 2 mM GalNAcp- 

20 benzyl. Li this Example we used wild type CHO/MUClsol-cloneC4 and western blot 
analysis of medium of cells treated with 1-2 mM GalNAca-benzyl, GalNAcfJ-benzyl, or 
the control GlcNAca-benzyl. Treatment with both GalNAca-benzyl and GalNAcp-benzyl 
showed inhibition of secreted MUCl compared to control treated or non-treated cells. The 
inhibitory effect on mucin secretion can be quantified by a number of assays known to the 

25 skilled in the art including western blot, ELISA, gelfiltration, immunocapture, and other 
assays. 

The in vivo cell line model system developed is one example of a method to screen 
for inhibitors effects of one or more compounds on secretion of mucins and O-linked 
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glycoproteins in cells. The Example utilizes MUCl but any mucin or 0-linked glycoprotein 
could be used with appropriate expression constructs, antibodies and reagents. The 
developed cell model and modifications hereof can be used for high throughput screens of 
inhibitors in combination with or as a second screen after the binding assays disclosed in 
5 Examples 2 and 3. 

8. GalNAcjS-benzyl inhibits mucin glycosylation and MUC5AC production in 
HT29MTXceUs. 

In order to evaluate GalNAcjS-benzyl as an inhibitor of mucin glycosylation and 
secretion we used the cultured colon carcinoma cells of mucin secreting phenotypes: HT- 

10 29 metotrexate (MTX) cells selected fi-om the HT-29 cell line by culture in the presence of 
MTX (kind gift from G. Huet, France). The HT-29 MTX cells were selected to yield a 
goblet cell like phenotype with constitutive production of MUC5AC. Li order to ensure 
proper goblet cell differentiation the HT29MTX cell line were continuously grown up to 21 
days in the presence of 5 mM GalNAca-benzyl, 5 mM GalNAcP-benzyl, 5 mM GlcNAca- 

1 5 benzyl and without inhibitor using culture conditions as described in Hennebicq-Reig et al. 
(Permanent exposm-e of mucin-secreting HT-29 cells to benzyl-N-acetyl-alpha-D- 
galactosaminide induces abnormal 0-glycosylation of mucins and inhibits constitutive and 
stimulated MUC5AC secretion. Biochem J. 1998;334:283-95). At 7, 14 and 21 days of 
continuous culture, media was collected and cells were fixed in 3% paraformaldehyde and 

20 stained for the presence of MUC5AC and specific carbohydrate structures using immuno- 
histochemistry with the following monoclonal antibodies: CLH2 recognizing MUC5 AC 
(Reis C. et al. hnmunohistochemical study of MUC5AC expression in human gastric 
carcinomas using a novel monoclonal antibody. Int. J. Cancer. 1997;74:1 12-21), 1E3 
recognizing Tn (GalNAca linked to serine or threonine), 3C9 recognizing T (GaljSl- 

25 3GalNAca linked to serine or threonine, and 3F1 recognizing Sialyl-Tn (NeuAca2- 
6GalNAca linked to serine or threonine. In order to evaluate the amoimt of MUC5AC 
secreted by HT-29 MTX cells with and without GaMAcjS-benzyl treatment, media were 
subjected to SDS-PAGE (Tris-NuPAGE Gels 14-21 %) followed by Westem blotting 
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using polyclonal rabbit anti human MUC5AC antibody LUM5-1 (Carlstedt et al 
MUC5 AC, but not MUC2, is a prominent mucin in respiratory secretions. Glycoconj J. 
1996;13:839-47). Furthermore the viscosity of the media from cells with and without 
inhibitor was evaluated using viscosity micro-measurement. 
5 Treatment of cells with GalNAcjS-benzyl and GalNAca-benzyl diminished 

secretion of MUC5AC as evaluated by western blot compared with GlcNAcjS-benzyl and 
cells treated with media alone (Fig. 17). Furthermore in accordance with earlier finding 
(Hennebicq-Reig et al. Permanent exposure of mucin-secreting HT-29 cells to benzyl-N- 
acetyl-alpha-D-galactosaminide induces abnormal 0-glycosylation of mucins and inhibits 

10 constitutive and stimulated MUC5AC secretion. Biochem J. 1998;334:283-95) treatment of 
cells with GalNAca-benzyl (Fig. 18, Panel A) induced a storage phenotype with increased 
intracellular staining of MUC5AC. In contrast, cells treated with GalNAcjS-benzyl (Fig. 18, 
Panel B) did not lead to any storage disease but diminished intracellular staining of 
MUC5AC compared with and GlcNAcjS-benzyl (Fig 18, panel C) and cells treated with 

15 media alone (Fig 1 8, panel D). In addition the O-linked mucin carbohydrate structures T, 
Tn and sialyl-Tn were upregulated in cells treated with GlcNAcjS-benzyl compared to 
controls. Media from GalNAcjS-benzyl treated cells were less viscous compared with 
control media (data not shown). In conclusion the novel inhibitor GalNAcjS-benzyl inhibits 
both O-linked mucin glycosylation and MUC5 AC secretion. 

20 
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